Difference between revisions of "AI tutorials"
KevinYager (talk | contribs) (→LLM) |
KevinYager (talk | contribs) (→LLM) |
||
Line 19: | Line 19: | ||
==LLM== | ==LLM== | ||
+ | * [https://transformer-circuits.pub/2022/toy_model/index.html Toy Models of Superposition] | ||
* [https://www.techrxiv.org/doi/full/10.36227/techrxiv.23589741.v1 A Survey on Large Language Models: Applications, Challenges, Limitations, and Practical Usage] | * [https://www.techrxiv.org/doi/full/10.36227/techrxiv.23589741.v1 A Survey on Large Language Models: Applications, Challenges, Limitations, and Practical Usage] | ||
* [https://aman.ai/ Aman AI]: [https://aman.ai/primers/ai/LLM/ Overview of Large Language Models] | * [https://aman.ai/ Aman AI]: [https://aman.ai/primers/ai/LLM/ Overview of Large Language Models] |
Latest revision as of 13:54, 13 April 2025
Contents
General
- MLU-EXPLAIN
- Deep Learning is Not So Mysterious or Different
- AI Digest: Interactive AI explainers
- OpenAI Academy
Loss Functions
Transformer
- Wolfram: What Is ChatGPT Doing … and Why Does It Work?
- The Illustrated Transformer
- Transformers Explained Visually — Not Just How, but Why They Work So Well
Visualizations
LLM
- Toy Models of Superposition
- A Survey on Large Language Models: Applications, Challenges, Limitations, and Practical Usage
- Aman AI: Overview of Large Language Models
- Awesome-LLM: Curated list of LLM projects
- The Big Book of Large Language Models (Damien Benveniste)
- 2025-01: Foundations of Large Language Models
Video
- Andrej Karpathy:
Prompt Engineering
- 2025-04: Lee Boonstra (Google): Prompt Engineering