Interactive · from first principles
internals
Interactive, first-principles tutorials for modern AI systems & system components.
01 Coming soon
Speculative Decoding
How language models generate several tokens per forward pass — without changing a single output.
LLM inference sampling
On the roadmap
Reinforcement learning Post-training Scaling laws Training infrastructure