Difference between revisions of "AI tutorials"
KevinYager (talk | contribs) |
KevinYager (talk | contribs) (→LLM) |
||
Line 18: | Line 18: | ||
* [https://www.techrxiv.org/doi/full/10.36227/techrxiv.23589741.v1 A Survey on Large Language Models: Applications, Challenges, Limitations, and Practical Usage] | * [https://www.techrxiv.org/doi/full/10.36227/techrxiv.23589741.v1 A Survey on Large Language Models: Applications, Challenges, Limitations, and Practical Usage] | ||
* [https://aman.ai/ Aman AI]: [https://aman.ai/primers/ai/LLM/ Overview of Large Language Models] | * [https://aman.ai/ Aman AI]: [https://aman.ai/primers/ai/LLM/ Overview of Large Language Models] | ||
+ | * 2025-01: [https://arxiv.org/abs/2501.09223 Foundations of Large Language Models] | ||
==Other Visualizations== | ==Other Visualizations== | ||
* [https://pytorch.org/blog/inside-the-matrix/ Inside the Matrix: Visualizing Matrix Multiplication, Attention and Beyond] | * [https://pytorch.org/blog/inside-the-matrix/ Inside the Matrix: Visualizing Matrix Multiplication, Attention and Beyond] | ||
* [https://sohl-dickstein.github.io/2024/02/12/fractal.html Neural network training makes beautiful fractals] | * [https://sohl-dickstein.github.io/2024/02/12/fractal.html Neural network training makes beautiful fractals] |
Revision as of 14:11, 17 January 2025
General
Loss Functions
Transformer
- Wolfram: What Is ChatGPT Doing … and Why Does It Work?
- The Illustrated Transformer
- Transformers Explained Visually — Not Just How, but Why They Work So Well
Visualizations
LLM
- A Survey on Large Language Models: Applications, Challenges, Limitations, and Practical Usage
- Aman AI: Overview of Large Language Models
- 2025-01: Foundations of Large Language Models