Difference between revisions of "Science Agents"

From GISAXS
Jump to: navigation, search
(Adapting LLMs to Science)
(AI/ML Methods in Science)
 
(5 intermediate revisions by the same user not shown)
Line 42: Line 42:
 
* 2023-12: [https://www.nature.com/articles/s41524-024-01423-2 Opportunities for retrieval and tool augmented large language models in scientific facilities]
 
* 2023-12: [https://www.nature.com/articles/s41524-024-01423-2 Opportunities for retrieval and tool augmented large language models in scientific facilities]
 
* 2023-12: [https://arxiv.org/abs/2312.17180 Virtual Scientific Companion for Synchrotron Beamlines: A Prototype]
 
* 2023-12: [https://arxiv.org/abs/2312.17180 Virtual Scientific Companion for Synchrotron Beamlines: A Prototype]
 +
* 2023-12: [https://www.nature.com/articles/s41586-023-06792-0 Autonomous chemical research with large language models]
 
* 2024-01: [https://iopscience.iop.org/article/10.1088/2632-2153/ad52e9 Synergizing Human Expertise and AI Efficiency with Language Model for Microscopy Operation and Automated Experiment Design]
 
* 2024-01: [https://iopscience.iop.org/article/10.1088/2632-2153/ad52e9 Synergizing Human Expertise and AI Efficiency with Language Model for Microscopy Operation and Automated Experiment Design]
 +
* 2024-06: [https://pubs.rsc.org/en/content/articlelanding/2025/dd/d4dd00143e From Text to Test: AI-Generated Control Software for Materials Science Instruments]
 
* 2024-12: [https://arxiv.org/abs/2412.18161 VISION: A Modular AI Assistant for Natural Human-Instrument Interaction at Scientific User Facilities]
 
* 2024-12: [https://arxiv.org/abs/2412.18161 VISION: A Modular AI Assistant for Natural Human-Instrument Interaction at Scientific User Facilities]
 +
* 2025-01: [https://www.science.org/doi/10.1126/sciadv.adr4173 Large language models for human-machine collaborative particle accelerator tuning through natural language]
  
 
==AI/ML Methods tailored to Science==
 
==AI/ML Methods tailored to Science==
Line 92: Line 95:
 
* [https://x.com/vant_ai/status/1903070297991110657 2025-03]: [https://www.vant.ai/ Vant AI] [https://www.vant.ai/neo-1 Neo-1]: atomistic foundation model (small molecules, proteins, etc.)
 
* [https://x.com/vant_ai/status/1903070297991110657 2025-03]: [https://www.vant.ai/ Vant AI] [https://www.vant.ai/neo-1 Neo-1]: atomistic foundation model (small molecules, proteins, etc.)
 
* 2025-03: [https://arxiv.org/abs/2503.16351 Lyra: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences]
 
* 2025-03: [https://arxiv.org/abs/2503.16351 Lyra: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences]
 +
 +
===Medicine===
 +
* 2025-04: [https://arxiv.org/abs/2504.05186 Training state-of-the-art pathology foundation models with orders of magnitude less data]
  
 
===Successes===
 
===Successes===
Line 121: Line 127:
 
* 2025-02: [https://arxiv.org/abs/2502.20309 EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants]
 
* 2025-02: [https://arxiv.org/abs/2502.20309 EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants]
 
* 2025-03: [https://huggingface.co/datasets/futurehouse/BixBench BixBench]: Novel hypotheses (accept/reject)
 
* 2025-03: [https://huggingface.co/datasets/futurehouse/BixBench BixBench]: Novel hypotheses (accept/reject)
 +
* 2025-04: [https://research.google/blog/evaluating-progress-of-llms-on-scientific-problem-solving/ Google: Evaluating progress of LLMs on scientific problem-solving]
 +
** 2025-03: [https://arxiv.org/abs/2503.13517 CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning]
 +
** 2024-07: [https://arxiv.org/abs/2407.09413 SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers]
 +
** 2024-10: [https://neurips.cc/virtual/2024/98540 FEABench: Evaluating Language Models on Real World Physics Reasoning Ability]
  
 
=Science Agents=
 
=Science Agents=
Line 138: Line 148:
 
* 2024-12-30: [https://arxiv.org/abs/2412.21154 Aviary: training language agents on challenging scientific tasks]
 
* 2024-12-30: [https://arxiv.org/abs/2412.21154 Aviary: training language agents on challenging scientific tasks]
 
* See also: [[AI_Agents#Deep_Research|AI Agents > Deep Research]]
 
* See also: [[AI_Agents#Deep_Research|AI Agents > Deep Research]]
 +
* 2025-04-08: Sakana: [https://pub.sakana.ai/ai-scientist-v2/paper/paper.pdf The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search] ([https://github.com/SakanaAI/AI-Scientist-v2 code])
  
 
==Science Multi-Agent Setups==
 
==Science Multi-Agent Setups==
 
* 2025-01: [https://arxiv.org/abs/2501.04227 Agent Laboratory: Using LLM Agents as Research Assistants]
 
* 2025-01: [https://arxiv.org/abs/2501.04227 Agent Laboratory: Using LLM Agents as Research Assistants]
 +
* 2025-04: [https://www.nature.com/articles/s41551-025-01363-2 Coordinated AI agents for advancing healthcare] ([https://www.nature.com/articles/s41551-025-01363-2.epdf?sharing_token=CIYP3J8LZE4BX31fV3WxUdRgN0jAjWel9jnR3ZoTv0O9iD-yhgqzRaz_7VASayWRePPhWDD2xFyfuOpSXbdPaOtt7oH4nfXo7telALzNwY3V1p9SxoqBEJy2OuaJ_cA35-CYQC1XgjCNTZUw46dh1KX-Dj8e7-1Vk_RlZKFLrc8%3D pdf])
  
 
=AI Science Systems=
 
=AI Science Systems=
Line 179: Line 191:
 
==Chemistry==
 
==Chemistry==
 
* 2025-03: [https://jcheminf.biomedcentral.com/articles/10.1186/s13321-024-00834-z Rxn-INSIGHT: fast chemical reaction analysis using bond-electron matrices] ([https://rxn-insight.readthedocs.io/en/latest/ docs])
 
* 2025-03: [https://jcheminf.biomedcentral.com/articles/10.1186/s13321-024-00834-z Rxn-INSIGHT: fast chemical reaction analysis using bond-electron matrices] ([https://rxn-insight.readthedocs.io/en/latest/ docs])
 +
 +
=Science Datasets=
 +
* [https://github.com/blaiszik/awesome-matchem-datasets/ Awesome Materials & Chemistry Datasets]
  
 
=See Also=
 
=See Also=
 
* [[AI agents]]
 
* [[AI agents]]
 
* [https://nanobot.chat/ Nanobot.chat]: Intelligent AI for the labnetwork @ mtl.mit.edu forum
 
* [https://nanobot.chat/ Nanobot.chat]: Intelligent AI for the labnetwork @ mtl.mit.edu forum

Latest revision as of 12:03, 8 April 2025

AI Use-cases for Science

Literature

LLM extract data from papers

AI finding links in literature

(Pre) Generate Articles

Explanation

Autonomous Ideation

Adapting LLMs to Science

AI/LLM Control of Scientific Instruments/Facilities

AI/ML Methods tailored to Science

Regression (Data Fitting)

Tabular Classification/Regression

Symbolic Regression

Literature Discovery

Commercial

AI/ML Methods in Science

Chemistry

Biology

Medicine

Successes

AI/ML Methods co-opted for Science

Mechanistic Interpretability

Train large model on science data. Then apply mechanistic interpretability (e.g. sparse autoencoders, SAE) to the feature/activation space.

Uncertainty

Science Benchmarks

Science Agents

Reviews

Specific

Science Multi-Agent Setups

AI Science Systems

Inorganic Materials Discovery

Chemistry

LLMs Optimized for Science

Impact of AI in Science

Related Tools

Literature Search

Data Visualization

Generative

Chemistry

Science Datasets

See Also