Ten volumes, 100 chapters—each with an exercise to reinforce the material. Start with Volume 1: Mathematical Foundations and work through in order, or jump to a volume that matches your level.

  1. Volume 1: Mathematical Foundations — Chapters 1–10
  2. Volume 2: Tabular Methods & Classic Algorithms — Chapters 11–20
  3. Volume 3: Value Function Approximation & Deep Q-Learning — Chapters 21–30
  4. Volume 4: Policy Gradients — Chapters 31–40
  5. Volume 5: Advanced Policy Optimization — Chapters 41–50
  6. Volume 6: Model-Based RL & Planning — Chapters 51–60
  7. Volume 7: Exploration and Meta-Learning — Chapters 61–70
  8. Volume 8: Offline RL & Imitation Learning — Chapters 71–80
  9. Volume 9: Multi-Agent RL (MARL) — Chapters 81–90
  10. Volume 10: Real-World RL, Safety & Large Language Models — Chapters 91–100