Ten volumes, 100 chapters—each with an exercise to reinforce the material. Start with Volume 1: Mathematical Foundations and work through in order, or jump to a volume that matches your level.
- Volume 1: Mathematical Foundations — Chapters 1–10
- Volume 2: Tabular Methods & Classic Algorithms — Chapters 11–20
- Volume 3: Value Function Approximation & Deep Q-Learning — Chapters 21–30
- Volume 4: Policy Gradients — Chapters 31–40
- Volume 5: Advanced Policy Optimization — Chapters 41–50
- Volume 6: Model-Based RL & Planning — Chapters 51–60
- Volume 7: Exploration and Meta-Learning — Chapters 61–70
- Volume 8: Offline RL & Imitation Learning — Chapters 71–80
- Volume 9: Multi-Agent RL (MARL) — Chapters 81–90
- Volume 10: Real-World RL, Safety & Large Language Models — Chapters 91–100