Interactive · from first principles

internals

Interactive, first-principles tutorials for modern AI systems & system components.

01 Coming soon

Speculative Decoding

How language models generate several tokens per forward pass — without changing a single output.

LLM inference sampling

On the roadmap

Reinforcement learning Post-training Scaling laws Training infrastructure