Difference between revisions of "AI research trends"

From GISAXS
Jump to: navigation, search
(Diffusion Language Models)
(Novel Tokenization and/or Sampling)
Line 1: Line 1:
=Novel Tokenization and/or Sampling=
 
* 2024-04: [https://arxiv.org/abs/2404.19737 Better & Faster Large Language Models via Multi-token Prediction]
 
* 2024-10: [https://github.com/xjdr-alt/entropix entropix: Entropy Based Sampling and Parallel CoT Decoding]
 
* 2024-10: [https://arxiv.org/abs/2410.01104 softmax is not enough (for sharp out-of-distribution)]
 
* 2024-12: [https://arxiv.org/abs/2412.06676 I Don't Know: Explicit Modeling of Uncertainty with an <nowiki>[IDK]</nowiki> Token]
 
 
 
=System 2 Reasoning=
 
=System 2 Reasoning=
 
See: [[Increasing AI Intelligence]]
 
See: [[Increasing AI Intelligence]]

Revision as of 09:16, 2 April 2025

System 2 Reasoning

See: Increasing AI Intelligence

Memory

LLM Weights Memory

Context Length

Extended Context

Retrieval beyond RAG

See also: AI tools: Retrieval Augmented Generation (RAG)

Working Memory

Episodic Memory

Updating Weights at Inference-time

Parameters as Tokens

Internal Thought Representation Space

Visual Thinking

Neural (non-token) Latent Representation

Generation Strategies

Generation Order

Diffusion Language Models

Related: Image Synthesis via Autoregression/Diffusion

See Also