Revision as of 12:00, 17 January 2025

ERI

Research Thrusts

Models

How to adapt frontier methods and foundation models to science?

Topical fine-tuning
Tool-use
Advanced retrieval-augmented generation (RAG++)
- Novel: Pre-generation: Agents continually add content to RAG corpus. ("Pre-thinking" across many vectors.)
Science-adapted tokenization/embedding (xVal, [IDK])
Specialized sampling
- Entropy sampling: measure uncertainty of CoT trajectories
- Novel: Handoff sampling:
  - Useful for:
    - text-to-text (specialization, creativity, etc.)
    - text-to-tool (e.g. math)
    - test-to-field (integrate non-textual FM)
  - Implementation:
    - MI-SAE on both spaces, find matches (or maybe just "analogies"?)

Challenge: Connect reasoning models to domain models.

Latent space reasoning
Establish mappings (analogies) between interpretability spaces
Cycling recaptioning/reframing
Tokenizer-for-science: learn right spectrum of representations (for text/image reasoning model)

Agents

How to make AI agents smarter?

Iteration schemes (loops, blocks)
1. Thinking:
  - Blocky/neural: Define architecture, allow system to pick hyper-parameters
2. Autonomous ideation:
  - Novel: Treat ideation as an AE problem in a semantic embedding space.
3. Dynamic tree-of-thought: on-demand context generation, allows model to select among data representations (zoom, modality, etc.)
Encode Human Patterns
1. Human scientist workflows (ideation, solving, etc.)
2. Thought-templates, thought-flows
How to allow agents to run for long time-horizons coherently?
1. Basket of Metrics: Need to define metrics of: (1) research success, (2) uncertainty (entropy sampling?)
2. Tool-use to "call human" and request help/information
Memory
1. Allow system to insert and retrieve from RAG at will.

Exocortex

What is the right architecture for AI swarms?

Interaction schemes
1. Test options, identify match between science task and scheme
2. Treat interaction graph as ML optimization problem
3. Novel: Map-spatial: Use a map (e.g. of BNL) to localize docs/resources/etc.
4. Novel: Pseudo-spatial: Use position in embedding space to localize everything. Evolving state (velocity/momentum) of agent carries information.
5. Novel: Dynamic-pseudo-spatial: Allow the space to be learned and updated; directions in embedding space can dictate information flow
Establish benchmarks/challenges/validations

Infrastructure

Architecture

What software architecture is needed?

Code for scaffolding
Scheme for inter-agent messaging (plain English w/ pointers, etc.)
Data management

Hardware

How to implement inference-time compute for exocortex?

Heterogeneous hardware
Elastic (combine local & cloud)
Workflow management

Human-Computer Interaction (HCI)

What should the HCI be?

Resources

Need models, data, facilities, etc. all accessible as API endpoints.

@@ Line 16: / Line 16: @@
 #* Entropy sampling: measure uncertainty of CoT trajectories
 #* '''Novel:''' Handoff sampling:
-#** text-to-text (specialization, creativity, etc.)
+#** Useful for:
-#** text-to-tool (e.g. math)
+#*** text-to-text (specialization, creativity, etc.)
-#** test-to-field (integrate non-textual FM)
+#*** text-to-tool (e.g. math)
+#*** test-to-field (integrate non-textual FM)
+#** Implementation:
+#*** MI-SAE on both spaces, find matches (or maybe just "analogies"?)
+'''Challenge: Connect reasoning models to domain models.'''
+# Latent space reasoning
+# Establish mappings (analogies) between interpretability spaces
+# Cycling recaptioning/reframing
+# Tokenizer-for-science: learn right spectrum of representations (for text/image reasoning model)
 ==Agents==
@@ Line 26: / Line 35: @@
 # Iteration schemes (loops, blocks)
+## Thinking:
+##* Blocky/neural: Define architecture, allow system to pick hyper-parameters
 ## Autonomous ideation:
 ##* '''Novel:''' Treat ideation as an AE problem in a semantic embedding space.
+## Dynamic tree-of-thought: on-demand context generation, allows model to select among data representations (zoom, modality, etc.)
+# Encode Human Patterns
+## Human scientist workflows (ideation, solving, etc.)
+## Thought-templates, thought-flows
+# How to allow agents to run for long time-horizons coherently?
+## ''Basket of Metrics'': Need to define metrics of: (1) research success, (2) uncertainty (entropy sampling?)
+## Tool-use to "call human" and request help/information
 # Memory
-# Thought-templates, thought-flows
+## Allow system to insert and retrieve from RAG at will.
 ==Exocortex==
@@ Line 40: / Line 58: @@
 ## Treat interaction graph as ML optimization problem
 ## '''Novel:''' Map-spatial: Use a map (e.g. of BNL) to localize docs/resources/etc.
-## '''Novel:''' Pseudo-spatial: Use position in embedding space to localize everything
+## '''Novel:''' Pseudo-spatial: Use position in embedding space to localize everything. Evolving state (velocity/momentum) of agent carries information.
 ## '''Novel:''' Dynamic-pseudo-spatial: Allow the space to be learned and updated; directions in embedding space can dictate information flow
 # Establish benchmarks/challenges/validations
@@ Line 57: / Line 75: @@
 ===Human-Computer Interaction (HCI)===
 '''What should the HCI be?'''
+===Resources===
+# Need models, data, facilities, etc. all accessible as API endpoints.

Difference between revisions of "ERI"

Revision as of 12:00, 17 January 2025

Contents

Research Thrusts

Models

Agents

Exocortex

Infrastructure

Architecture

Hardware

Human-Computer Interaction (HCI)

Resources

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools