Difference between revisions of "Science Agents"

From GISAXS
Jump to: navigation, search
(Autonomous Ideation)
(Mechanistic Interpretability)
 
(7 intermediate revisions by the same user not shown)
Line 10: Line 10:
 
* 2024-09: [https://arxiv.org/abs/2409.14202 Mining Causality: AI-Assisted Search for Instrumental Variables]
 
* 2024-09: [https://arxiv.org/abs/2409.14202 Mining Causality: AI-Assisted Search for Instrumental Variables]
 
* 2024-12: [https://arxiv.org/abs/2412.07977 Thinking Fast and Laterally: Multi-Agentic Approach for Reasoning about Uncertain Emerging Events]
 
* 2024-12: [https://arxiv.org/abs/2412.07977 Thinking Fast and Laterally: Multi-Agentic Approach for Reasoning about Uncertain Emerging Events]
 +
* 2024-12: [https://arxiv.org/abs/2412.14141 LLMs can realize combinatorial creativity: generating creative ideas via LLMs for scientific research]
  
 
==Adapting LLMs to Science==
 
==Adapting LLMs to Science==
Line 17: Line 18:
  
 
==AI/ML Methods tailored to Science==
 
==AI/ML Methods tailored to Science==
 +
===Regression (Data Fitting)===
 +
* 2024-06: [https://arxiv.org/abs/2406.14546 Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data]: training on (x,y) pairs enables inferring underlying function (define it in code, invert it, compose it)
 +
* 2024-12: [https://arxiv.org/abs/2402.14547 OmniPred: Language Models as Universal Regressors]
 +
 
===Symbolic Regression===
 
===Symbolic Regression===
 
* 2024-09: [https://arxiv.org/abs/2409.09359 Symbolic Regression with a Learned Concept Library]
 
* 2024-09: [https://arxiv.org/abs/2409.09359 Symbolic Regression with a Learned Concept Library]
Line 35: Line 40:
 
* Mechanistic interpretability for protein language models ([https://interprot.com/ visualizer], [https://github.com/etowahadams/interprot/tree/main code], [https://huggingface.co/liambai/InterProt-ESM2-SAEs SAE])
 
* Mechanistic interpretability for protein language models ([https://interprot.com/ visualizer], [https://github.com/etowahadams/interprot/tree/main code], [https://huggingface.co/liambai/InterProt-ESM2-SAEs SAE])
 
* [https://www.markov.bio/ Markov Bio]: [https://www.markov.bio/research/mech-interp-path-to-e2e-biology Through a Glass Darkly: Mechanistic Interpretability as the Bridge to End-to-End Biology] ([https://x.com/adamlewisgreen/status/1853206279499751531 quick description], [https://markovbio.github.io/biomedical-progress/ background info on recent bio progress])
 
* [https://www.markov.bio/ Markov Bio]: [https://www.markov.bio/research/mech-interp-path-to-e2e-biology Through a Glass Darkly: Mechanistic Interpretability as the Bridge to End-to-End Biology] ([https://x.com/adamlewisgreen/status/1853206279499751531 quick description], [https://markovbio.github.io/biomedical-progress/ background info on recent bio progress])
 +
* 2023-01: [https://arxiv.org/abs/2301.05062 Tracr: Compiled Transformers as a Laboratory for Interpretability] ([https://github.com/google-deepmind/tracr code])
 +
* 2024-12: [https://www.arxiv.org/abs/2412.16247 Towards scientific discovery with dictionary learning: Extracting biological concepts from microscopy foundation models]
 +
* 2024-12: [https://arxiv.org/abs/2412.12101 InterPLM: Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders]
 +
* 2025-01: [https://arxiv.org/abs/2501.00089 Insights on Galaxy Evolution from Interpretable Sparse Feature Networks]
  
 
===Uncertainty===
 
===Uncertainty===
Line 50: Line 59:
 
* 2024-12-06: [https://www.biorxiv.org/content/10.1101/2024.11.11.623004v1 The Virtual Lab: AI Agents Design New SARS-CoV-2 Nanobodies with Experimental Validation] (writeup: [https://www.nature.com/articles/d41586-024-01684-3 Virtual lab powered by ‘AI scientists’ super-charges biomedical research: Could human–AI collaborations be the future of interdisciplinary studies?])
 
* 2024-12-06: [https://www.biorxiv.org/content/10.1101/2024.11.11.623004v1 The Virtual Lab: AI Agents Design New SARS-CoV-2 Nanobodies with Experimental Validation] (writeup: [https://www.nature.com/articles/d41586-024-01684-3 Virtual lab powered by ‘AI scientists’ super-charges biomedical research: Could human–AI collaborations be the future of interdisciplinary studies?])
 
* 2024-12-11: Google [https://blog.google/products/gemini/google-gemini-deep-research/ Deep Research]
 
* 2024-12-11: Google [https://blog.google/products/gemini/google-gemini-deep-research/ Deep Research]
 +
* 2024-12-30: [https://arxiv.org/abs/2412.21154 Aviary: training language agents on challenging scientific tasks]
  
 
=AI Science Systems=
 
=AI Science Systems=

Latest revision as of 09:43, 4 January 2025

AI Use-cases for Science

Literature

AI finding links in literature

Autonomous Ideation

Adapting LLMs to Science

AI/ML Methods tailored to Science

Regression (Data Fitting)

Symbolic Regression

Literature Discovery

Commercial

AI/ML Methods co-opted for Science

Mechanistic Interpretability

Train large model on science data. Then apply mechanistic interpretability (e.g. sparse autoencoders, SAE) to the feature/activation space.

Uncertainty

Science Agents

AI Science Systems

Inorganic Materials Discovery

Chemistry

Impact of AI in Science

Related Tools

Data Visualization

See Also