Ten volumes, 100 chaptersβ€”each with an exercise to reinforce the material. Start with Volume 1: Mathematical Foundations and work through in order, or jump to a volume that matches your level.

  1. Volume 1: Mathematical Foundations β€” Chapters 1–10
  2. Volume 2: Tabular Methods & Classic Algorithms β€” Chapters 11–20
  3. Volume 3: Value Function Approximation & Deep Q-Learning β€” Chapters 21–30
  4. Volume 4: Policy Gradients β€” Chapters 31–40
  5. Volume 5: Advanced Policy Optimization β€” Chapters 41–50
  6. Volume 6: Model-Based RL & Planning β€” Chapters 51–60
  7. Volume 7: Exploration and Meta-Learning β€” Chapters 61–70
  8. Volume 8: Offline RL & Imitation Learning β€” Chapters 71–80
  9. Volume 9: Multi-Agent RL (MARL) β€” Chapters 81–90
  10. Volume 10: Real-World RL, Safety & Large Language Models β€” Chapters 91–100