2 pages
Transformers
Transformers vs. Diffusion Models: Not All AI Is the Same
BeviaLLM: Building a Language Model From Scratch With Python and NumPy