Difference between revisions of "Science Agents"

From GISAXS

Jump to: navigation, search

Latest revision as of 09:59, 10 May 2025

Contents

1 AI Use-cases for Science
2 Science Benchmarks
3 Science Agents
4 AI Science Systems
5 Impact of AI in Science
6 Related Tools
7 Science Datasets
8 See Also

AI Use-cases for Science

Literature

alphaXiv | Explore: Understand arXiv papers

LLM extract data from papers

2024-14: From text to insight: large language models for chemical data extraction

AI finding links in literature

(Pre) Generate Articles

2022-12: Re3: Generating Longer Stories With Recursive Reprompting and Revision
2023-03: English essays: Artificial intelligence (AI) technology in OpenAI ChatGPT application: A review of ChatGPT in writing English essay
2023-01: Journalism: Collaborating With ChatGPT: Considering the Implications of Generative Artificial Intelligence for Journalism and Media Education
2023-07: Science writing: Artificial intelligence in scientific writing: a friend or a foe?
2024-02: Wikipedia style: Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models
2024-02: LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs (code)
2024-08: Scientific papers: The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
2024-09: PaperQA2: Language Models Achieve Superhuman Synthesis of Scientific Knowledge (𝕏 post, code)
2025-03: Reasoning to Learn from Latent Thoughts
2025-03: WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation
2025-04: Sleep-time Compute: Beyond Inference Scaling at Test-time

Explanation

2025-02: TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding (preprint)
2025-04: Do Two AI Scientists Agree?

Autonomous Ideation

Adapting LLMs to Science

AI/LLM Control of Scientific Instruments/Facilities

AI/ML Methods tailored to Science

Regression (Data Fitting)

2024-06: Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data: training on (x,y) pairs enables inferring underlying function (define it in code, invert it, compose it)
2024-12: OmniPred: Language Models as Universal Regressors

Tabular Classification/Regression

2025-01: Accurate predictions on small data with a tabular foundation model (code)

Symbolic Regression

2024-09: Symbolic Regression with a Learned Concept Library

Literature Discovery

Commercial

Sakana AI
Cusp AI: Materials/AI
Lila AI: Life sciences
Radical AI: Material simulation/design
Autoscience (Carl)

Bio

AI/ML Methods in Science

Chemistry

2025-01: Large language models for reticular chemistry
2025-02: Image-based generation for molecule design with SketchMol
2025-02: Large language models for scientific discovery in molecular property prediction
2025-03: Vant AI Neo-1: atomistic foundation model (small molecules, proteins, etc.)

Biology

2018: AlphaFold
2021-07: AlphaFold 2
2024-05: AlphaFold 3
2023-03: Evolutionary-scale prediction of atomic-level protein structure with a language model (ESMFold)
2023-11: Illuminating protein space with a programmable generative model
2024-11: Sequence modeling and design from molecular to genome scale with Evo (Evo)
2025-01: Targeting protein–ligand neosurfaces with a generalizable deep learning tool (Chroma)
2025-01: Simulating 500 million years of evolution with a language model (ESM 3 model)
2025-02: Genome modeling and design across all domains of life with Evo 2
2025-02: Exploring the structural changes driving protein function with BioEmu-1
2025-02: Protein Large Language Models: A Comprehensive Survey
2025-03: Vant AI Neo-1: atomistic foundation model (small molecules, proteins, etc.)
2025-03: Lyra: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences

Medicine

See: AI_Agents#Medicine

Successes

2025-02: Site-Decorated Model for Unconventional Frustrated Magnets: Ultranarrow Phase Crossover and Spin Reversal Transition

AI/ML Methods co-opted for Science

Mechanistic Interpretability

Train large model on science data. Then apply mechanistic interpretability (e.g. sparse autoencoders, SAE) to the feature/activation space.

Mechanistic interpretability for protein language models (visualizer, code, SAE)
Markov Bio: Through a Glass Darkly: Mechanistic Interpretability as the Bridge to End-to-End Biology (quick description, background info on recent bio progress)
2023-01: Tracr: Compiled Transformers as a Laboratory for Interpretability (code)
2024-10: An X-Ray Is Worth 15 Features: Sparse Autoencoders for Interpretable Radiology Report Generation
2024-12: Towards scientific discovery with dictionary learning: Extracting biological concepts from microscopy foundation models
2024-12: InterPLM: Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders
2025-01: Insights on Galaxy Evolution from Interpretable Sparse Feature Networks
2025-02: From Mechanistic Interpretability to Mechanistic Biology: Training, Evaluating, and Interpreting Sparse Autoencoders on Protein Language Models
2025-02: Interpreting Evo 2: Arc Institute's Next-Generation Genomic Foundation Model

Uncertainty

Science Benchmarks

2024-07: SciCode: A Research Coding Benchmark Curated by Scientists (project)
2024-11: AidanBench: Evaluating Novel Idea Generation on Open-Ended Questions (code)
2024-12: LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context
2025-01: Humanity's Last Exam
ScienceAgentBench
2025-02: EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants
2025-03: BixBench: Novel hypotheses (accept/reject)
2025-04: Google: Evaluating progress of LLMs on scientific problem-solving

Science Agents

Reviews

2024-10: Empowering biomedical discovery with AI agents
2025-01: A review of large language models and autonomous agents in chemistry (github)

Specific

2024-01-13: ORGANA: A Robotic Assistant for Automated Chemistry Experimentation and Characterization (video)
2024-06-19: LLMatDesign: Autonomous Materials Discovery with Large Language Models
2024-08-12: Sakana AI: AI Scientist; The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery (code)
2024-09-09: SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning (code)
2024-09-11: PaperQA2: Language Models Achieve Superhuman Synthesis of Scientific Knowledge (𝕏 post, code)
2024-10-17: Rapid and Automated Alloy Design with Graph Neural Network-Powered LLM-Driven Multi-Agent Systems
2024-10-28: Large Language Model-Guided Prediction Toward Quantum Materials Synthesis
2024-12-06: The Virtual Lab: AI Agents Design New SARS-CoV-2 Nanobodies with Experimental Validation (writeup: Virtual lab powered by ‘AI scientists’ super-charges biomedical research: Could human–AI collaborations be the future of interdisciplinary studies?)
2024-12-30: Aviary: training language agents on challenging scientific tasks
See also: AI Agents > Deep Research
2025-04-08: Sakana: The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search (code)

Science Multi-Agent Setups

2025-01: Agent Laboratory: Using LLM Agents as Research Assistants
2025-04: Coordinated AI agents for advancing healthcare (pdf)

AI Science Systems

2025-01: Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback
2025-02: Towards an AI co-scientist (Google blog post: Accelerating scientific breakthroughs with an AI co-scientist)

Inorganic Materials Discovery

2023-11: Scaling deep learning for materials discovery
2023-11: An autonomous laboratory for the accelerated synthesis of novel materials
2024-10: Open Materials 2024 (OMat24) Inorganic Materials Dataset and Models (code, datasets, checkpoints, blogpost)
2025-01: A generative model for inorganic materials design
2025-04: System of Agentic AI for the Discovery of Metal-Organic Frameworks

Chemistry

2023-12: Autonomous chemical research with large language models (Coscientist)
2024-11: An automatic end-to-end chemical synthesis development platform powered by large language models

LLMs Optimized for Science

2022-11: Galactica: A Large Language Model for Science
2025-02: MatterChat: A Multi-Modal LLM for Material Science
2025-03: OmniScience: A Domain-Specialized LLM for Scientific Reasoning and Discovery
2025-03: Google TxGemma (2B, 9B, 27B): drug development

Impact of AI in Science

Related Tools

Literature Search

Data Visualization

2024-10: Microsoft Data Formulator: Create Rich Visualization with AI iteratively (video, code)
Julius AI: Analyze your data with computational AI

Generative

2025-03: StarVector 1B, 8B: text or image to SVG

Chemistry

2025-03: Rxn-INSIGHT: fast chemical reaction analysis using bond-electron matrices (docs)

Science Datasets

Awesome Materials & Chemistry Datasets

See Also

AI agents
Nanobot.chat: Intelligent AI for the labnetwork @ mtl.mit.edu forum

Retrieved from "http://gisaxs.com/index.php?title=Science_Agents&oldid=7744"