Difference between revisions of "AI tutorials"
KevinYager (talk | contribs) (Created page with "==General== * [https://mlu-explain.github.io/ MLU-EXPLAIN] ==Loss Functions== * [https://gombru.github.io/2018/05/23/cross_entropy_loss/ Understanding Categorical Cross-Entro...") |
KevinYager (talk | contribs) (→Visualizations) |
||
(One intermediate revision by the same user not shown) | |||
Line 11: | Line 11: | ||
===Visualizations=== | ===Visualizations=== | ||
* [https://bbycroft.net/llm LLM Visualization] | * [https://bbycroft.net/llm LLM Visualization] | ||
+ | * [https://poloclub.github.io/transformer-explainer/ Transformer Explainer] ([https://arxiv.org/abs/2408.04619 paper]) | ||
+ | * [https://moebio.com/mind/ Phrase completion] | ||
+ | * Karpathy: [https://colab.research.google.com/drive/1SVS-ALf9ToN6I6WmJno5RQkZEHFhaykJ#scrollTo=57wUOMOhaL2y Tiktoken Emoji] | ||
==Other Visualizations== | ==Other Visualizations== | ||
* [https://pytorch.org/blog/inside-the-matrix/ Inside the Matrix: Visualizing Matrix Multiplication, Attention and Beyond] | * [https://pytorch.org/blog/inside-the-matrix/ Inside the Matrix: Visualizing Matrix Multiplication, Attention and Beyond] | ||
+ | * [https://sohl-dickstein.github.io/2024/02/12/fractal.html Neural network training makes beautiful fractals] |
Latest revision as of 15:26, 9 December 2024
General
Loss Functions
Transformer
- Wolfram: What Is ChatGPT Doing … and Why Does It Work?
- The Illustrated Transformer
- Transformers Explained Visually — Not Just How, but Why They Work So Well