Difference between revisions of "ERI"

From GISAXS
Jump to: navigation, search
(Agents)
(Exocortex)
Line 34: Line 34:
 
==Exocortex==
 
==Exocortex==
 
'''What is the right architecture for AI swarms?'''
 
'''What is the right architecture for AI swarms?'''
 +
 +
[[Image:Coord schemes02.png|400px]]
 +
 
# Interaction schemes
 
# Interaction schemes
 
## Test options, identify match between science task and scheme
 
## Test options, identify match between science task and scheme

Revision as of 14:11, 13 January 2025

ERI

Research Thrusts

Models

How to adapt frontier methods and foundation models to science?

Cognitive block11.png

  1. Topical fine-tuning
  2. Tool-use
  3. Advanced retrieval-augmented generation (RAG++)
    • Novel: Pre-generation: Agents continually add content to RAG corpus. ("Pre-thinking" across many vectors.)
  4. Science-adapted tokenization/embedding (xVal, [IDK])
  5. Specialized sampling
    • Entropy sampling: measure uncertainty of CoT trajectories
    • Novel: Handoff sampling:
      • text-to-text (specialization, creativity, etc.)
      • text-to-tool (e.g. math)
      • test-to-field (integrate non-textual FM)

Agents

How to make AI agents smarter?

Agent thinking01.png

  1. Iteration schemes (loops, blocks)
    1. Autonomous ideation:
      • Novel: Treat ideation as an AE problem in a semantic embedding space.
  1. Memory
  2. Thought-templates, thought-flows

Exocortex

What is the right architecture for AI swarms?

Coord schemes02.png

  1. Interaction schemes
    1. Test options, identify match between science task and scheme
    2. Treat interaction graph as ML optimization problem
    3. Novel: Map-spatial: Use a map (e.g. of BNL) to localize docs/resources/etc.
    4. Novel: Pseudo-spatial: Use position in embedding space to localize everything
    5. Novel: Dynamic-pseudo-spatial: Allow the space to be learned and updated; directions in embedding space can dictate information flow
  2. Establish benchmarks/challenges/validations

Infrastructure

Architecture

What software architecture is needed?

  1. Code for scaffolding
  2. Scheme for inter-agent messaging (plain English w/ pointers, etc.)
  3. Data management

Hardware

How to implement inference-time compute for exocortex?

  1. Heterogeneous hardware
  2. Elastic (combine local & cloud)
  3. Workflow management

Human-Computer Interaction (HCI)

What should the HCI be?