Latest revision as of 12:43, 16 April 2026

AI Use-cases for Science

Literature

alphaXiv | Explore: Understand arXiv papers
2026-02: Synthesizing scientific literature with retrieval-augmented language models

LLM extract data from papers

2024-14: From text to insight: large language models for chemical data extraction

AI finding links in literature

(Pre) Generate Articles

2022-12: Re3: Generating Longer Stories With Recursive Reprompting and Revision
2023-03: English essays: Artificial intelligence (AI) technology in OpenAI ChatGPT application: A review of ChatGPT in writing English essay
2023-01: Journalism: Collaborating With ChatGPT: Considering the Implications of Generative Artificial Intelligence for Journalism and Media Education
2023-07: Science writing: Artificial intelligence in scientific writing: a friend or a foe?
2024-02: Wikipedia style: Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models
2024-02: LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs (code)
2024-08: Scientific papers: The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
- 2026-04: Towards end-to-end automation of AI research
2024-09: PaperQA2: Language Models Achieve Superhuman Synthesis of Scientific Knowledge (𝕏 post, code)
2025-03: Reasoning to Learn from Latent Thoughts
2025-03: WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation
2025-04: Sleep-time Compute: Beyond Inference Scaling at Test-time

Explanation

2025-02: TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding (preprint)
2025-04: Do Two AI Scientists Agree?

Autonomous Ideation

Adapting LLMs to Science

AI/LLM Control of Scientific Instruments/Facilities

AI/ML Methods tailored to Science

Literature Discovery

Commercial

Sakana AI
Cusp AI: Materials/AI
Lila AI: Life sciences
Radical AI: Material simulation/design
Autoscience (Carl)
Periodic Labs
Edison Scientific (drug discovery, spinoff from FutureHouse)
2026-03: Mirendil Inc.: advanced models to speed up R&D in scientific domains, especially biology and materials science

Bio

AI/ML Methods in Science

2025-07: Synthetic Scientific Image Generation with VAE, GAN, and Diffusion Model Architectures

Imaging

2025-05: Behind the Noise: Conformal Quantile Regression Reveals Emergent Representations (blog: Behind the Noise)

Materials

Chemistry

2025-01: Large language models for reticular chemistry
2025-02: Image-based generation for molecule design with SketchMol
2025-02: Large language models for scientific discovery in molecular property prediction
2025-03: Vant AI Neo-1: atomistic foundation model (small molecules, proteins, etc.)
2025-04: Compositional Flows for 3D Molecule and Synthesis Pathway Co-design
2025-07: General purpose models for the chemical sciences
2025-11: ChemTorch: A Deep Learning Framework for Benchmarking and Developing Chemical Reaction Property Prediction Models

Biology

2018: AlphaFold
2021-07: AlphaFold 2
2024-05: AlphaFold 3
2023-03: Evolutionary-scale prediction of atomic-level protein structure with a language model (ESMFold)
2023-11: Illuminating protein space with a programmable generative model
2024-11: Sequence modeling and design from molecular to genome scale with Evo (Evo)
2025-01: Targeting protein–ligand neosurfaces with a generalizable deep learning tool (Chroma)
2025-01: Simulating 500 million years of evolution with a language model (ESM 3 model)
2025-02: Genome modeling and design across all domains of life with Evo 2
2025-02: Exploring the structural changes driving protein function with BioEmu-1
2025-02: Protein Large Language Models: A Comprehensive Survey
2025-03: Vant AI Neo-1: atomistic foundation model (small molecules, proteins, etc.)
2025-03: Lyra: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences
2025-08: RosettaFold 3: Accelerating Biomolecular Modeling with AtomWorks and RF3
2025-09: Generative design of novel bacteriophages with genome language models
2025-10: Strengthening nucleic acid biosecurity screening against generative protein design tools
2026-01: Advancing regulatory variant effect prediction with AlphaGenome

Medicine

See: AI_Agents#Medicine

Successes

2025-02: Site-Decorated Model for Unconventional Frustrated Magnets: Ultranarrow Phase Crossover and Spin Reversal Transition

AI/ML Methods co-opted for Science

Mechanistic Interpretability

Train large model on science data. Then apply mechanistic interpretability (e.g. sparse autoencoders, SAE) to the feature/activation space.

Mechanistic interpretability for protein language models (visualizer, code, SAE)
Markov Bio: Through a Glass Darkly: Mechanistic Interpretability as the Bridge to End-to-End Biology (quick description, background info on recent bio progress)
2023-01: Tracr: Compiled Transformers as a Laboratory for Interpretability (code)
2024-10: An X-Ray Is Worth 15 Features: Sparse Autoencoders for Interpretable Radiology Report Generation
2024-12: Towards scientific discovery with dictionary learning: Extracting biological concepts from microscopy foundation models
2024-12: InterPLM: Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders
2025-01: Insights on Galaxy Evolution from Interpretable Sparse Feature Networks
2025-02: From Mechanistic Interpretability to Mechanistic Biology: Training, Evaluating, and Interpreting Sparse Autoencoders on Protein Language Models
2025-02: Interpreting Evo 2: Arc Institute's Next-Generation Genomic Foundation Model
2026-01: Using Interpretability to Identify a Novel Class of Alzheimer's Biomarkers

Uncertainty

Science Benchmarks

2024-07: SciCode: A Research Coding Benchmark Curated by Scientists (project)
2024-11: AidanBench: Evaluating Novel Idea Generation on Open-Ended Questions (code)
2024-12: LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context
2025-01: Humanity's Last Exam
ScienceAgentBench
2025-02: EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants
2025-03: BixBench: Novel hypotheses (accept/reject)
2025-04: Google: Evaluating progress of LLMs on scientific problem-solving
2025-07: SciArena: A New Platform for Evaluating Foundation Models in Scientific Literature Tasks (vote, data, code)
2026-02: Edison: LABBench 2
2026-04: LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning (site, code)

Science Agents

Reviews

Challenges

2026-01: Why LLMs Aren't Scientists Yet: Lessons from Four Autonomous Research Attempts

Specific

2024-01-13: ORGANA: A Robotic Assistant for Automated Chemistry Experimentation and Characterization (video)
2024-06-19: LLMatDesign: Autonomous Materials Discovery with Large Language Models
2024-08-12: Sakana AI: AI Scientist; The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery (code)
2024-09-09: SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning (code)
2024-09-11: PaperQA2: Language Models Achieve Superhuman Synthesis of Scientific Knowledge (𝕏 post, code)
2024-10-17: Rapid and Automated Alloy Design with Graph Neural Network-Powered LLM-Driven Multi-Agent Systems
2024-10-28: Large Language Model-Guided Prediction Toward Quantum Materials Synthesis
2024-12-06: The Virtual Lab: AI Agents Design New SARS-CoV-2 Nanobodies with Experimental Validation (writeup: Virtual lab powered by ‘AI scientists’ super-charges biomedical research: Could human–AI collaborations be the future of interdisciplinary studies?)
2024-12-30: Aviary: training language agents on challenging scientific tasks
See also: AI Agents > Deep Research
2025-04-08: Sakana: The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search (code)
2025-07: DREAMS: Density Functional Theory Based Research Engine for Agentic Materials Simulation
2025-11: Kosmos: An AI Scientist for Autonomous Discovery
2025-11: SciAgent: A Unified Multi-Agent System for Generalistic Scientific Reasoning
2026-02: PaperBanana: Automating Academic Illustration for AI Scientists
2026-03: AI Agents Can Already Autonomously Perform Experimental High Energy Physics
2026-04: Denario: scientific research assistant

Science Multi-Agent Setups

2025-01: Agent Laboratory: Using LLM Agents as Research Assistants
2025-04: Coordinated AI agents for advancing healthcare (pdf)

Science Agentic Components

Frameworks

Anthropic Claude Agent SKD overview
OpenClaw
OpenCode
OpenHands
LAMM: MIT Laboratory for Atomistic and Molecular Mechanics
- ScienceClaw: Framework for autonomous scientific investigation without central coordination.
- Infinite: The Infinite Corridor of Scientific Discovery. Open science, powered by many — agents and humans discovering together.

Personalities

2026-03: The Agency: AI Specialists Ready to Transform Your Workflow

Skills

2026-03: Claude Scientific Skills (list)

AI Science Systems

2025-01: Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback
2025-01: Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents
2025-02: Towards an AI co-scientist (Google blog post: Accelerating scientific breakthroughs with an AI co-scientist)
2025-06: The Discovery Engine
- 2025-07: Benchmarking the Discovery Engine (blog)
2025-07: Autonomous Scientific Discovery Through Hierarchical AI Scientist Systems
2025-12: Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
2026-01: SciSciGPT: advancing human–AI collaboration in the science of science
2026-02: AUTODISCOVERY: Open-ended Scientific Discovery via Bayesian Surprise (Allen AI (Ai2) AstraLabs, blog, tools)

Inorganic Materials Discovery

2023-11: Scaling deep learning for materials discovery
2023-11: An autonomous laboratory for the accelerated synthesis of novel materials
2024-09: HoneyComb: A Flexible LLM-Based Agent System for Materials Science
2024-10: Open Materials 2024 (OMat24) Inorganic Materials Dataset and Models (code, datasets, checkpoints, blogpost)
2025-01: A generative model for inorganic materials design
2025-04: System of Agentic AI for the Discovery of Metal-Organic Frameworks
2025-05: The Open Molecules 2025 (OMol25) Dataset, Evaluations, and Models

Materials Characterization

2025-08: Operationalizing Serendipity: Multi-Agent AI Workflows for Enhanced Materials Characterization with Theory-in-the-Loop

Chemistry

2023-12: Autonomous chemical research with large language models (Coscientist)
2024-09: PNNL ChemAIst V0.2
2024-11: An automatic end-to-end chemical synthesis development platform powered by large language models
2025-06: Training a Scientific Reasoning Model for Chemistry
2025-06: ChemGraph: An Agentic Framework for Computational Chemistry Workflows (code)

Bio

2025-07: BioMARS: A Multi-Agent Robotic System for Autonomous Biological Experiments

Physics

2025-12: PhysMaster: Building an Autonomous AI Physicist for Theoretical and Computational Physics Research

LLMs Optimized for Science

2022-11: Galactica: A Large Language Model for Science
2024-12: Crystal structure generation with autoregressive large language modeling
2025-02: MatterChat: A Multi-Modal LLM for Material Science
2025-03: OmniScience: A Domain-Specialized LLM for Scientific Reasoning and Discovery
2025-03: Google TxGemma (2B, 9B, 27B): drug development

Impact of AI in Science

Related Tools

Literature Search

Data Visualization

2024-10: Microsoft Data Formulator: Create Rich Visualization with AI iteratively (video, code)
Julius AI: Analyze your data with computational AI

Generative

2025-03: StarVector 1B, 8B: text or image to SVG

Chemistry

2025-03: Rxn-INSIGHT: fast chemical reaction analysis using bond-electron matrices (docs)

Science Datasets

Genuine Discoveries

Math

2023-07: Faster sorting algorithms discovered using deep reinforcement learning
2025-06: AlphaEvolve: A coding agent for scientific and algorithmic discovery
2025-11: Mathematical exploration and discovery at scale
2025-11: Olympiad-level formal mathematical reasoning with reinforcement learning
2025-12: Extremal descendant integrals on moduli spaces of curves: An inequality discovered and proved in collaboration with AI
AI Solving Erdős Problems:
- 2026-01: Erdős Problem #728 and #729 solved by Aristotle using ChatGPT 5.2 Pro
- 2026-01: Erdős Problem #397 solved by Neel Somani using ChatGPT 5.2 Pro
- 2026-01: Erdős Problem #205 solved by Aristotle using ChatGPT 5.2 Pro
- 2026-01: Erdős Problem #281 solved by Neel Somani using ChatGPT 5.2 Pro
- 2026-01: Google DeepMind: Irrationality of rapidly converging series: a problem of Erdős and Graham
  - Erdős Problem #1051 solved by Google DeepMind Aletheia agent
- 2026-01: Google DeepMind: Semi-Autonomous Mathematics Discovery with Gemini: A Case Study on the Erdős Problems
  - Attempted 700 problems, solved 13 open Erdős problems: 5 novel autonomous solutions, 8 through existing literature.
- 2026-02: Erdős Problem #846
  - Google DeepMind
  - Using OpenAI internal model (paper: On infinite sets with no 3 on a line)
- 2026-03: Three problems solved using OpenAI GPT internal model. Paper: Short Proofs in Combinatorics and Number Theory
- 2026-04: Erdős Problem #1196 solved by Leeham using ChatGPT 5.4 Pro
- 2026-04: Erdős Problem #258 solved by Przemek Chojecki using ChatGPT 5.4 Pro
2026-01: The motivic class of the space of genus 0 maps to the flag variety
2026-02: Google DeepMind: Towards Autonomous Mathematics Research
2026-03: Donald Knuth: A problem in Directed Hamiltonian Cycles solved by Filip Stappers using Claude Opus 4.6
2026-03: Google DeepMind: Reinforced Generation of Combinatorial Structures: Ramsey Numbers
2026-03: FrontierMath problem: "A Ramsey-style Problem on Hypergraphs" solved by Kevin Barreto and Liam Price using GPT-5.4 Pro

Physics assistance

Literature exploration

2025-11: Kosmos: An AI Scientist for Autonomous Discovery (Edison)

Bio design

2023-07: De novo design of protein structure and function with RFdiffusion
2025-11: Atomically accurate de novo design of antibodies with RFdiffusion
2025-11: AlphaFold: Five years of impact
2026-01: Using Interpretability to Identify a Novel Class of Alzheimer's Biomarkers

Material Discovery

2023-11: Scaling deep learning for materials discovery

@@ Line 4: / Line 4: @@
 ==Literature==
 * [https://www.alphaxiv.org/explore alphaXiv | Explore]: Understand arXiv papers
+* 2026-02: [https://www.nature.com/articles/s41586-025-10072-4 Synthesizing scientific literature with retrieval-augmented language models]
 ===LLM extract data from papers===
@@ Line 20: / Line 21: @@
 * 2024-02: [https://arxiv.org/abs/2408.07055 LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs] ([https://github.com/THUDM/LongWriter code])
 * 2024-08: Scientific papers: [https://arxiv.org/abs/2408.06292 The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery]
+** 2026-04: [https://www.nature.com/articles/s41586-026-10265-5 Towards end-to-end automation of AI research]
 * 2024-09: PaperQA2: [https://paper.wikicrow.ai/ Language Models Achieve Superhuman Synthesis of Scientific Knowledge] ([https://x.com/SGRodriques/status/1833908643856818443 𝕏 post], [https://github.com/Future-House/paper-qa code])
 * 2025-03: [https://arxiv.org/abs/2503.18866 Reasoning to Learn from Latent Thoughts]
@@ Line 39: / Line 41: @@
 * 2025-06: [https://arxiv.org/abs/2506.00794 Predicting Empirical AI Research Outcomes with Language Models]
 * 2025-06: [https://arxiv.org/abs/2506.20803 The Ideation-Execution Gap: Execution Outcomes of LLM-Generated versus Human Research Ideas]
+* 2026-03: [https://arxiv.org/abs/2603.14473 AI Can Learn Scientific Taste]
 ==Adapting LLMs to Science==
@@ Line 79: / Line 82: @@
 * [https://github.com/TheBlewish/Automated-AI-Web-Researcher-Ollama Automated-AI-Web-Researcher-Ollama]
 * 2025-01: [https://arxiv.org/abs/2501.05366 Search-o1: Agentic Search-Enhanced Large Reasoning Models] ([https://search-o1.github.io/ project], [https://github.com/sunnynexus/Search-o1 code])
+* 2026-02: [https://www.nature.com/articles/s41586-025-10072-4 Synthesizing scientific literature with retrieval-augmented language models] ([https://allenai.org/blog/openscholar-nature blog])
 ===Commercial===
@@ Line 88: / Line 92: @@
 * [https://periodic.com/ Periodic Labs]
 * [https://edisonscientific.com/articles/announcing-edison-scientific Edison Scientific] (drug discovery, spinoff from [https://www.futurehouse.org/ FutureHouse])
+* 2026-03: Mirendil Inc.: advanced models to speed up R&D in scientific domains, especially biology and materials science
 ====Bio====
@@ Line 130: / Line 135: @@
 * 2025-09: [https://www.biorxiv.org/content/10.1101/2025.09.12.675911v1 Generative design of novel bacteriophages with genome language models]
 * 2025-10: [https://www.science.org/doi/10.1126/science.adu8578 Strengthening nucleic acid biosecurity screening against generative protein design tools]
+* 2026-01: [https://www.nature.com/articles/s41586-025-10014-0 Advancing regulatory variant effect prediction with AlphaGenome]
 ===Medicine===
@@ Line 149: / Line 155: @@
 * 2025-02: [https://www.biorxiv.org/content/10.1101/2025.02.06.636901v1 From Mechanistic Interpretability to Mechanistic Biology: Training, Evaluating, and Interpreting Sparse Autoencoders on Protein Language Models]
 * 2025-02: [https://www.goodfire.ai/blog/interpreting-evo-2 Interpreting Evo 2: Arc Institute's Next-Generation Genomic Foundation Model]
+* 2026-01: [https://www.goodfire.ai/research/interpretability-for-alzheimers-detection# Using Interpretability to Identify a Novel Class of Alzheimer's Biomarkers]
 ===Uncertainty===
@@ Line 166: / Line 173: @@
 ** 2024-07: [https://arxiv.org/abs/2407.09413 SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers]
 ** 2024-10: [https://neurips.cc/virtual/2024/98540 FEABench: Evaluating Language Models on Real World Physics Reasoning Ability]
+* 2025-07: [https://allenai.org/blog/sciarena SciArena: A New Platform for Evaluating Foundation Models in Scientific Literature Tasks] ([https://sciarena.allen.ai/ vote], [https://huggingface.co/datasets/yale-nlp/SciArena data], [https://github.com/yale-nlp/SciArena code])
+* 2026-02: [https://edisonscientific.com/ Edison]: [https://lab-bench.ai/ LABBench 2]
+* 2026-04: [https://arxiv.org/abs/2604.14140 LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning] ([https://longcot.ai/ site], [https://github.com/LongHorizonReasoning/longcot code])
 =Science Agents=
@@ Line 192: / Line 202: @@
 * 2025-11: [https://arxiv.org/abs/2511.02824 Kosmos: An AI Scientist for Autonomous Discovery]
 * 2025-11: [https://arxiv.org/abs/2511.08151 SciAgent: A Unified Multi-Agent System for Generalistic Scientific Reasoning]
+* 2026-02: [https://arxiv.org/abs/2601.23265 PaperBanana: Automating Academic Illustration for AI Scientists]
+* 2026-03: [https://arxiv.org/abs/2603.20179 AI Agents Can Already Autonomously Perform Experimental High Energy Physics]
+* 2026-04: [https://github.com/AstroPilot-AI/Denario Denario]: scientific research assistant
 ==Science Multi-Agent Setups==
 * 2025-01: [https://arxiv.org/abs/2501.04227 Agent Laboratory: Using LLM Agents as Research Assistants]
 * 2025-04: [https://www.nature.com/articles/s41551-025-01363-2 Coordinated AI agents for advancing healthcare] ([https://www.nature.com/articles/s41551-025-01363-2.epdf?sharing_token=CIYP3J8LZE4BX31fV3WxUdRgN0jAjWel9jnR3ZoTv0O9iD-yhgqzRaz_7VASayWRePPhWDD2xFyfuOpSXbdPaOtt7oH4nfXo7telALzNwY3V1p9SxoqBEJy2OuaJ_cA35-CYQC1XgjCNTZUw46dh1KX-Dj8e7-1Vk_RlZKFLrc8%3D pdf])
+=Science Agentic Components=
+==Frameworks==
+* [https://platform.claude.com/docs/en/agent-sdk/overview Anthropic Claude Agent SKD overview]
+* [https://openclaw.ai/ OpenClaw]
+* [https://opencode.ai/ OpenCode]
+* [https://github.com/OpenHands/software-agent-sdk OpenHands]
+* [https://github.com/lamm-mit?tab=repositories LAMM: MIT Laboratory for Atomistic and Molecular Mechanics]
+** [https://github.com/lamm-mit/scienceclaw ScienceClaw]: Framework for autonomous scientific investigation without central coordination.
+** [https://infinite-lamm.vercel.app/ Infinite]: The Infinite Corridor of Scientific Discovery. Open science, powered by many — agents and humans discovering together.
+==Personalities==
+* 2026-03: [https://github.com/msitarzewski/agency-agents The Agency: AI Specialists Ready to Transform Your Workflow]
+==Skills==
+* 2026-03: [https://github.com/K-Dense-AI/claude-scientific-skills/tree/main?tab=readme-ov-file#use-cases Claude Scientific Skills] (list)
 =AI Science Systems=
@@ Line 206: / Line 235: @@
 * 2025-12: [https://arxiv.org/abs/2512.16969 Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows]
 * 2026-01: [https://www.nature.com/articles/s43588-025-00906-6 SciSciGPT: advancing human–AI collaboration in the science of science]
+* 2026-02: [https://allenai.org/papers/autodiscovery AUTODISCOVERY: Open-ended Scientific Discovery via Bayesian Surprise] (Allen AI (Ai2) AstraLabs, [https://allenai.org/blog/autodiscovery blog], [https://autodiscovery.allen.ai/runs tools])
 ===Inorganic Materials Discovery===
@@ Line 243: / Line 273: @@
 ** 2025-05: Retraction: [https://economics.mit.edu/news/assuring-accurate-research-record Assuring an accurate research record]
 * 2025-02: [https://arxiv.org/abs/2502.05151 Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation]
+* 2026-02: [https://arxiv.org/abs/2602.03837 Accelerating Scientific Research with Gemini: Case Studies and Common Techniques]
 =Related Tools=
@@ Line 267: / Line 298: @@
 * 2025-11: [https://cdn.openai.com/pdf/4a25f921-e4e0-479a-9b38-5367b47e8fd0/early-science-acceleration-experiments-with-gpt-5.pdf Early science acceleration experiments with GPT-5]
 * 2025-12: [https://andymasley.substack.com/p/ai-can-obviously-create-new-knowledge AI can obviously create new knowledge - But maybe not new concepts]
+==Math==
+* 2023-07: [https://www.nature.com/articles/s41586-023-06004-9?utm_source=chatgpt.com Faster sorting algorithms discovered using deep reinforcement learning]
+* 2025-06: [https://arxiv.org/abs/2506.13131 AlphaEvolve: A coding agent for scientific and algorithmic discovery]
+* 2025-11: [https://arxiv.org/abs/2511.02864 Mathematical exploration and discovery at scale]
+* 2025-11: [https://www.nature.com/articles/s41586-025-09833-y Olympiad-level formal mathematical reasoning with reinforcement learning]
+* 2025-12: [https://arxiv.org/abs/2512.14575 Extremal descendant integrals on moduli spaces of curves: An inequality discovered and proved in collaboration with AI]
+* [https://github.com/teorth/erdosproblems/wiki/AI-contributions-to-Erd%C5%91s-problems AI Solving Erdős Problems]:
+** 2026-01: [https://www.erdosproblems.com/728 Erdős Problem #728] and [https://www.erdosproblems.com/729 #729] solved by Aristotle using ChatGPT 5.2 Pro
+** 2026-01: [https://www.erdosproblems.com/forum/thread/397 Erdős Problem #397] [https://x.com/neelsomani/status/2010215162146607128?s=20 solved] by [https://neelsomani.com/ Neel Somani] using ChatGPT 5.2 Pro
+** 2026-01: [https://www.erdosproblems.com/205 Erdős Problem #205] solved by Aristotle using ChatGPT 5.2 Pro
+** 2026-01: [https://www.erdosproblems.com/forum/thread/281 Erdős Problem #281] [https://x.com/neelsomani/status/2012695714187325745?s=20 solved] by [https://neelsomani.com/ Neel Somani] using ChatGPT 5.2 Pro
+** 2026-01: Google DeepMind: [https://arxiv.org/abs/2601.21442 Irrationality of rapidly converging series: a problem of Erdős and Graham]
+*** [https://www.erdosproblems.com/1051 Erdős Problem #1051] [https://x.com/slow_developer/status/2018321002623901885?s=20 solved] by Google DeepMind Aletheia agent
+** 2026-01: Google DeepMind: [https://arxiv.org/abs/2601.22401 Semi-Autonomous Mathematics Discovery with Gemini: A Case Study on the Erdős Problems]
+*** Attempted 700 problems, solved 13 open Erdős problems: 5 novel autonomous solutions, 8 through existing literature.
+** 2026-02: [https://www.erdosproblems.com/846 Erdős Problem #846]
+*** [https://x.com/roydanroy/status/2026804567178953048?s=20 Google DeepMind]
+*** [https://x.com/mehtaab_sawhney/status/2026716221933343147?s=20 Using OpenAI internal model] (paper: [https://cdn.openai.com/infinite-sets/main_single_clean3.pdf On infinite sets with no 3 on a line])
+** 2026-03: Three problems solved using OpenAI GPT internal model. Paper: [https://arxiv.org/pdf/2603.29961 Short Proofs in Combinatorics and Number Theory]
+** 2026-04: [https://www.erdosproblems.com/forum/thread/1196 Erdős Problem #1196] [https://x.com/Liam06972452/status/2044051379916882067?s=20 solved] by [https://x.com/Liam06972452 Leeham] using ChatGPT 5.4 Pro
+** 2026-04: [https://www.erdosproblems.com/forum/thread/258 Erdős Problem #258] [https://x.com/prz_chojecki/status/2044129595729854493?s=20 solved] by [https://x.com/prz_chojecki Przemek Chojecki] using ChatGPT 5.4 Pro
+* 2026-01: [https://arxiv.org/abs/2601.07222 The motivic class of the space of genus 0 maps to the flag variety]
+* 2026-02: Google DeepMind: [https://arxiv.org/abs/2602.10177 Towards Autonomous Mathematics Research]
+* 2026-03: Donald Knuth: [https://www-cs-faculty.stanford.edu/~knuth/papers/claude-cycles.pdf A problem in Directed Hamiltonian Cycles] solved by Filip Stappers using Claude Opus 4.6
+* 2026-03: Google DeepMind: [https://arxiv.org/abs/2603.09172 Reinforced Generation of Combinatorial Structures: Ramsey Numbers]
+* 2026-03: [https://epoch.ai/frontiermath/open-problems FrontierMath] problem: [https://epoch.ai/frontiermath/open-problems/ramsey-hypergraphs "A Ramsey-style Problem on Hypergraphs"] solved by Kevin Barreto and Liam Price using GPT-5.4 Pro
-* '''Math:'''
+==Physics assistance==
-** 2023-07: [https://www.nature.com/articles/s41586-023-06004-9?utm_source=chatgpt.com Faster sorting algorithms discovered using deep reinforcement learning]
+* 2025-03: [https://arxiv.org/abs/2503.23758 Exact solution of the frustrated Potts model with next-nearest-neighbor interactions in one dimension via AI bootstrapping] ([https://www.bnl.gov/staff/wyin Weiguo Yin])
-** 2025-11: [https://arxiv.org/abs/2511.02864 Mathematical exploration and discovery at scale]
+* 2025-12: [https://www.sciencedirect.com/science/article/pii/S0370269325008111 Relativistic covariance and nonlinear quantum mechanics: Tomonaga-Schwinger analysis]
-** 2025-11: [https://www.nature.com/articles/s41586-025-09833-y Olympiad-level formal mathematical reasoning with reinforcement learning]
+** [https://x.com/hsu_steve/status/1996034522308026435?s=20 Steve Hsu], [https://drive.google.com/file/d/16sxJuwsHoi-fvTFbri9Bu8B9bqA6lr1H/view Theoretical Physics with Generative AI]
-** 2025-12: [https://arxiv.org/abs/2512.14575 Extremal descendant integrals on moduli spaces of curves: An inequality discovered and proved in collaboration with AI]
+* 2026-02: [https://arxiv.org/abs/2602.12176 Single-minus gluon tree amplitudes are nonzero] (GPT-5.2, [https://openai.com/index/new-result-theoretical-physics/ blog])
-** [https://github.com/teorth/erdosproblems/wiki/AI-contributions-to-Erd%C5%91s-problems AI Solving Erdős Problems]:
-*** 2026-01: [https://www.erdosproblems.com/728 Erdős Problem #728] and [https://www.erdosproblems.com/729 #729] solved by Aristotle using ChatGPT 5.2 Pro
+==Literature exploration==
-*** 2026-01: [https://www.erdosproblems.com/forum/thread/397 Erdős Problem #397] [https://x.com/neelsomani/status/2010215162146607128?s=20 solved] by [https://neelsomani.com/ Neel Somani] using ChatGPT 5.2 Pro
+* 2025-11: [https://arxiv.org/abs/2511.02824 Kosmos: An AI Scientist for Autonomous Discovery] ([https://edisonscientific.com/ Edison])
-*** 2026-01: [https://www.erdosproblems.com/205 Erdős Problem #205] solved by Aristotle using ChatGPT 5.2 Pro
+** [https://platform.edisonscientific.com/kosmos/c4bdef64-5e9b-43b9-a365-592dd1ed7587 Nucleotide metabolism in hypothermia]
-*** 2026-01: [https://www.erdosproblems.com/forum/thread/281 Erdős Problem #281] [https://x.com/neelsomani/status/2012695714187325745?s=20 solved] by [https://neelsomani.com/ Neel Somani] using ChatGPT 5.2 Pro
+** [https://platform.edisonscientific.com/kosmos/1fdbf827-be65-4d97-9b66-bf0da600091a Determinant of perovskite solar-cell failure]
-** 2026-01: [https://arxiv.org/abs/2601.07222 The motivic class of the space of genus 0 maps to the flag variety]
+** [https://platform.edisonscientific.com/kosmos/4fb3fbdb-c449-4064-9aa6-ff4ec53131d8 Log-normal connectivity in neural networks]
-* '''Physics assistance:'''
+** [https://platform.edisonscientific.com/kosmos/c6849232-5858-4634-adf5-83780afbe3db SOD2 as driver of myocardial fibrosis]
-** 2025-03: [https://arxiv.org/abs/2503.23758 Exact solution of the frustrated Potts model with next-nearest-neighbor interactions in one dimension via AI bootstrapping] ([https://www.bnl.gov/staff/wyin Weiguo Yin])
+** [https://platform.edisonscientific.com/kosmos/abac07da-a6bb-458f-b0ba-ef08f1be617e Protective variant of SSR1 in type 2 diabetes]
-** 2025-12: [https://www.sciencedirect.com/science/article/pii/S0370269325008111 Relativistic covariance and nonlinear quantum mechanics: Tomonaga-Schwinger analysis]
+** [https://platform.edisonscientific.com/kosmos/a770052b-2334-4bbe-b086-5149e0f03d99 Temporal ordering in Alzheimer’s disease]
-*** [https://x.com/hsu_steve/status/1996034522308026435?s=20 Steve Hsu], [https://drive.google.com/file/d/16sxJuwsHoi-fvTFbri9Bu8B9bqA6lr1H/view Theoretical Physics with Generative AI]
+** [https://platform.edisonscientific.com/kosmos/28c427d2-be31-48b5-b272-28d5a1e3ea5c Mechanism of neuron vulnerability in aging]
-* '''Literature exploration:'''
+==Bio design==
-** 2025-11: [https://arxiv.org/abs/2511.02824 Kosmos: An AI Scientist for Autonomous Discovery] ([https://edisonscientific.com/ Edison])
+* 2023-07: [https://www.nature.com/articles/s41586-023-06415-8 De novo design of protein structure and function with RFdiffusion]
-*** [https://platform.edisonscientific.com/kosmos/c4bdef64-5e9b-43b9-a365-592dd1ed7587 Nucleotide metabolism in hypothermia]
+* 2025-11: [https://www.nature.com/articles/s41586-025-09721-5 Atomically accurate de novo design of antibodies with RFdiffusion]
-*** [https://platform.edisonscientific.com/kosmos/1fdbf827-be65-4d97-9b66-bf0da600091a Determinant of perovskite solar-cell failure]
+* 2025-11: [https://deepmind.google/blog/alphafold-five-years-of-impact/ AlphaFold: Five years of impact]
-*** [https://platform.edisonscientific.com/kosmos/4fb3fbdb-c449-4064-9aa6-ff4ec53131d8 Log-normal connectivity in neural networks]
+* 2026-01: [https://www.goodfire.ai/research/interpretability-for-alzheimers-detection# Using Interpretability to Identify a Novel Class of Alzheimer's Biomarkers]
-*** [https://platform.edisonscientific.com/kosmos/c6849232-5858-4634-adf5-83780afbe3db SOD2 as driver of myocardial fibrosis]
+==Material Discovery==
-*** [https://platform.edisonscientific.com/kosmos/abac07da-a6bb-458f-b0ba-ef08f1be617e Protective variant of SSR1 in type 2 diabetes]
+* 2023-11: [https://doi.org/10.1038/s41586-023-06735-9 Scaling deep learning for materials discovery]
-*** [https://platform.edisonscientific.com/kosmos/a770052b-2334-4bbe-b086-5149e0f03d99 Temporal ordering in Alzheimer’s disease]
-*** [https://platform.edisonscientific.com/kosmos/28c427d2-be31-48b5-b272-28d5a1e3ea5c Mechanism of neuron vulnerability in aging]
-* '''Bio design:'''
-** 2023-07: [https://www.nature.com/articles/s41586-023-06415-8 De novo design of protein structure and function with RFdiffusion]
-** 2025-11: [https://www.nature.com/articles/s41586-025-09721-5 Atomically accurate de novo design of antibodies with RFdiffusion]
-** 2025-11: [https://deepmind.google/blog/alphafold-five-years-of-impact/ AlphaFold: Five years of impact]
-* '''Material Discovery:'''
-** 2023-11: [https://doi.org/10.1038/s41586-023-06735-9 Scaling deep learning for materials discovery]
 =See Also=
 * [[AI agents]]
 * [https://nanobot.chat/ Nanobot.chat]: Intelligent AI for the labnetwork @ mtl.mit.edu forum

Difference between revisions of "Science Agents"

Latest revision as of 12:43, 16 April 2026

Contents

AI Use-cases for Science

Literature

LLM extract data from papers

AI finding links in literature

(Pre) Generate Articles

Explanation

Autonomous Ideation

Adapting LLMs to Science

AI/LLM Control of Scientific Instruments/Facilities

AI/ML Methods tailored to Science

Science Foundation Models

Regression (Data Fitting)

Tabular Classification/Regression

Symbolic Regression

Literature Discovery

Commercial

Bio

AI/ML Methods in Science

Imaging

Materials

Chemistry

Biology

Medicine

Successes

AI/ML Methods co-opted for Science

Mechanistic Interpretability

Uncertainty

Science Benchmarks

Science Agents

Reviews

Challenges

Specific

Science Multi-Agent Setups

Science Agentic Components

Frameworks

Personalities

Skills

AI Science Systems

Inorganic Materials Discovery

Materials Characterization

Chemistry

Bio

Physics

LLMs Optimized for Science

Impact of AI in Science

Related Tools

Literature Search

Data Visualization

Generative

Chemistry

Science Datasets

Genuine Discoveries

Math

Physics assistance

Literature exploration

Bio design

Material Discovery

See Also

Navigation menu

Search