Difference between revisions of "AI tutorials"
KevinYager (talk | contribs) (Created page with "==General== * [https://mlu-explain.github.io/ MLU-EXPLAIN] ==Loss Functions== * [https://gombru.github.io/2018/05/23/cross_entropy_loss/ Understanding Categorical Cross-Entro...") |
KevinYager (talk | contribs) (→General) |
||
(12 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
==General== | ==General== | ||
* [https://mlu-explain.github.io/ MLU-EXPLAIN] | * [https://mlu-explain.github.io/ MLU-EXPLAIN] | ||
+ | * [https://arxiv.org/abs/2503.02113 Deep Learning is Not So Mysterious or Different] | ||
+ | * [https://theaidigest.org/ AI Digest]: Interactive AI explainers | ||
==Loss Functions== | ==Loss Functions== | ||
Line 11: | Line 13: | ||
===Visualizations=== | ===Visualizations=== | ||
* [https://bbycroft.net/llm LLM Visualization] | * [https://bbycroft.net/llm LLM Visualization] | ||
+ | * [https://poloclub.github.io/transformer-explainer/ Transformer Explainer] ([https://arxiv.org/abs/2408.04619 paper]) | ||
+ | * [https://moebio.com/mind/ Phrase completion] | ||
+ | * Karpathy: [https://colab.research.google.com/drive/1SVS-ALf9ToN6I6WmJno5RQkZEHFhaykJ#scrollTo=57wUOMOhaL2y Tiktoken Emoji] | ||
+ | |||
+ | ==LLM== | ||
+ | * [https://www.techrxiv.org/doi/full/10.36227/techrxiv.23589741.v1 A Survey on Large Language Models: Applications, Challenges, Limitations, and Practical Usage] | ||
+ | * [https://aman.ai/ Aman AI]: [https://aman.ai/primers/ai/LLM/ Overview of Large Language Models] | ||
+ | * [https://github.com/Hannibal046/Awesome-LLM Awesome-LLM]: Curated list of LLM projects | ||
+ | * [https://book.theaiedge.io/ The Big Book of Large Language Models] (Damien Benveniste) | ||
+ | * 2025-01: [https://arxiv.org/abs/2501.09223 Foundations of Large Language Models] | ||
+ | |||
+ | ===Video=== | ||
+ | * Andrej Karpathy: | ||
+ | ** [https://www.youtube.com/watch?v=EWvNQjAaOHw How I use LLMs] | ||
+ | ** [https://www.youtube.com/watch?v=7xTGNNLPyMI Deep Dive into LLMs like ChatGPT] | ||
==Other Visualizations== | ==Other Visualizations== | ||
+ | * [https://distill.pub/2019/visual-exploration-gaussian-processes/ A Visual Exploration of Gaussian Processes] | ||
* [https://pytorch.org/blog/inside-the-matrix/ Inside the Matrix: Visualizing Matrix Multiplication, Attention and Beyond] | * [https://pytorch.org/blog/inside-the-matrix/ Inside the Matrix: Visualizing Matrix Multiplication, Attention and Beyond] | ||
+ | * [https://sohl-dickstein.github.io/2024/02/12/fractal.html Neural network training makes beautiful fractals] |
Latest revision as of 15:00, 3 April 2025
Contents
General
- MLU-EXPLAIN
- Deep Learning is Not So Mysterious or Different
- AI Digest: Interactive AI explainers
Loss Functions
Transformer
- Wolfram: What Is ChatGPT Doing … and Why Does It Work?
- The Illustrated Transformer
- Transformers Explained Visually — Not Just How, but Why They Work So Well
Visualizations
LLM
- A Survey on Large Language Models: Applications, Challenges, Limitations, and Practical Usage
- Aman AI: Overview of Large Language Models
- Awesome-LLM: Curated list of LLM projects
- The Big Book of Large Language Models (Damien Benveniste)
- 2025-01: Foundations of Large Language Models
Video
- Andrej Karpathy: