Difference between revisions of "ERI"

From GISAXS
Jump to: navigation, search
(Agents)
(Agents)
 
(3 intermediate revisions by the same user not shown)
Line 22: Line 22:
 
#** Implementation:
 
#** Implementation:
 
#*** MI-SAE on both spaces, find matches (or maybe just "analogies"?)
 
#*** MI-SAE on both spaces, find matches (or maybe just "analogies"?)
 +
 +
'''Challenge: Connect reasoning models to domain models.'''
 +
# Latent space reasoning
 +
# Establish mappings (analogies) between interpretability spaces
 +
# Cycling recaptioning/reframing
 +
# Tokenizer-for-science: learn right spectrum of representations (for text/image reasoning model)
  
 
==Agents==
 
==Agents==
Line 33: Line 39:
 
## Autonomous ideation:
 
## Autonomous ideation:
 
##* '''Novel:''' Treat ideation as an AE problem in a semantic embedding space.
 
##* '''Novel:''' Treat ideation as an AE problem in a semantic embedding space.
 +
## Dynamic tree-of-thought: on-demand context generation, allows model to select among data representations (zoom, modality, etc.)
 
# Encode Human Patterns
 
# Encode Human Patterns
 
## Human scientist workflows (ideation, solving, etc.)
 
## Human scientist workflows (ideation, solving, etc.)
Line 68: Line 75:
 
===Human-Computer Interaction (HCI)===
 
===Human-Computer Interaction (HCI)===
 
'''What should the HCI be?'''
 
'''What should the HCI be?'''
 +
===Resources===
 +
# Need models, data, facilities, etc. all accessible as API endpoints.

Latest revision as of 12:00, 17 January 2025

ERI

Research Thrusts

Models

How to adapt frontier methods and foundation models to science?

Cognitive block11.png

  1. Topical fine-tuning
  2. Tool-use
  3. Advanced retrieval-augmented generation (RAG++)
    • Novel: Pre-generation: Agents continually add content to RAG corpus. ("Pre-thinking" across many vectors.)
  4. Science-adapted tokenization/embedding (xVal, [IDK])
  5. Specialized sampling
    • Entropy sampling: measure uncertainty of CoT trajectories
    • Novel: Handoff sampling:
      • Useful for:
        • text-to-text (specialization, creativity, etc.)
        • text-to-tool (e.g. math)
        • test-to-field (integrate non-textual FM)
      • Implementation:
        • MI-SAE on both spaces, find matches (or maybe just "analogies"?)

Challenge: Connect reasoning models to domain models.

  1. Latent space reasoning
  2. Establish mappings (analogies) between interpretability spaces
  3. Cycling recaptioning/reframing
  4. Tokenizer-for-science: learn right spectrum of representations (for text/image reasoning model)

Agents

How to make AI agents smarter?

Agent thinking01.png

  1. Iteration schemes (loops, blocks)
    1. Thinking:
      • Blocky/neural: Define architecture, allow system to pick hyper-parameters
    2. Autonomous ideation:
      • Novel: Treat ideation as an AE problem in a semantic embedding space.
    3. Dynamic tree-of-thought: on-demand context generation, allows model to select among data representations (zoom, modality, etc.)
  2. Encode Human Patterns
    1. Human scientist workflows (ideation, solving, etc.)
    2. Thought-templates, thought-flows
  3. How to allow agents to run for long time-horizons coherently?
    1. Basket of Metrics: Need to define metrics of: (1) research success, (2) uncertainty (entropy sampling?)
    2. Tool-use to "call human" and request help/information
  4. Memory
    1. Allow system to insert and retrieve from RAG at will.

Exocortex

What is the right architecture for AI swarms?

Coord schemes02.png

  1. Interaction schemes
    1. Test options, identify match between science task and scheme
    2. Treat interaction graph as ML optimization problem
    3. Novel: Map-spatial: Use a map (e.g. of BNL) to localize docs/resources/etc.
    4. Novel: Pseudo-spatial: Use position in embedding space to localize everything. Evolving state (velocity/momentum) of agent carries information.
    5. Novel: Dynamic-pseudo-spatial: Allow the space to be learned and updated; directions in embedding space can dictate information flow
  2. Establish benchmarks/challenges/validations

Infrastructure

Architecture

What software architecture is needed?

  1. Code for scaffolding
  2. Scheme for inter-agent messaging (plain English w/ pointers, etc.)
  3. Data management

Hardware

How to implement inference-time compute for exocortex?

  1. Heterogeneous hardware
  2. Elastic (combine local & cloud)
  3. Workflow management

Human-Computer Interaction (HCI)

What should the HCI be?

Resources

  1. Need models, data, facilities, etc. all accessible as API endpoints.