Difference between revisions of "ERI"
KevinYager (talk | contribs) (→Models) |
KevinYager (talk | contribs) (→Models) |
||
Line 27: | Line 27: | ||
# Establish mappings (analogies) between interpretability spaces | # Establish mappings (analogies) between interpretability spaces | ||
# Cycling recaptioning/reframing | # Cycling recaptioning/reframing | ||
− | # Tokenizer-for-science: learn right spectrum of representations | + | # Tokenizer-for-science: learn right spectrum of representations (for text/image reasoning model) |
==Agents== | ==Agents== |
Revision as of 11:58, 17 January 2025
ERI
Contents
Research Thrusts
Models
How to adapt frontier methods and foundation models to science?
- Topical fine-tuning
- Tool-use
- Advanced retrieval-augmented generation (RAG++)
- Novel: Pre-generation: Agents continually add content to RAG corpus. ("Pre-thinking" across many vectors.)
- Science-adapted tokenization/embedding (xVal, [IDK])
- Specialized sampling
- Entropy sampling: measure uncertainty of CoT trajectories
- Novel: Handoff sampling:
- Useful for:
- text-to-text (specialization, creativity, etc.)
- text-to-tool (e.g. math)
- test-to-field (integrate non-textual FM)
- Implementation:
- MI-SAE on both spaces, find matches (or maybe just "analogies"?)
- Useful for:
Challenge: Connect reasoning models to domain models.
- Latent space reasoning
- Establish mappings (analogies) between interpretability spaces
- Cycling recaptioning/reframing
- Tokenizer-for-science: learn right spectrum of representations (for text/image reasoning model)
Agents
How to make AI agents smarter?
- Iteration schemes (loops, blocks)
- Thinking:
- Blocky/neural: Define architecture, allow system to pick hyper-parameters
- Autonomous ideation:
- Novel: Treat ideation as an AE problem in a semantic embedding space.
- Thinking:
- Encode Human Patterns
- Human scientist workflows (ideation, solving, etc.)
- Thought-templates, thought-flows
- How to allow agents to run for long time-horizons coherently?
- Basket of Metrics: Need to define metrics of: (1) research success, (2) uncertainty (entropy sampling?)
- Tool-use to "call human" and request help/information
- Memory
- Allow system to insert and retrieve from RAG at will.
Exocortex
What is the right architecture for AI swarms?
- Interaction schemes
- Test options, identify match between science task and scheme
- Treat interaction graph as ML optimization problem
- Novel: Map-spatial: Use a map (e.g. of BNL) to localize docs/resources/etc.
- Novel: Pseudo-spatial: Use position in embedding space to localize everything. Evolving state (velocity/momentum) of agent carries information.
- Novel: Dynamic-pseudo-spatial: Allow the space to be learned and updated; directions in embedding space can dictate information flow
- Establish benchmarks/challenges/validations
Infrastructure
Architecture
What software architecture is needed?
- Code for scaffolding
- Scheme for inter-agent messaging (plain English w/ pointers, etc.)
- Data management
Hardware
How to implement inference-time compute for exocortex?
- Heterogeneous hardware
- Elastic (combine local & cloud)
- Workflow management
Human-Computer Interaction (HCI)
What should the HCI be?
Resources
- Need models, data, facilities, etc. all accessible as API endpoints.