Ten volumes, 100 chaptersβeach with an exercise to reinforce the material. Start with Volume 1: Mathematical Foundations and work through in order, or jump to a volume that matches your level.
- Volume 1: Mathematical Foundations β Chapters 1β10
- Volume 2: Tabular Methods & Classic Algorithms β Chapters 11β20
- Volume 3: Value Function Approximation & Deep Q-Learning β Chapters 21β30
- Volume 4: Policy Gradients β Chapters 31β40
- Volume 5: Advanced Policy Optimization β Chapters 41β50
- Volume 6: Model-Based RL & Planning β Chapters 51β60
- Volume 7: Exploration and Meta-Learning β Chapters 61β70
- Volume 8: Offline RL & Imitation Learning β Chapters 71β80
- Volume 9: Multi-Agent RL (MARL) β Chapters 81β90
- Volume 10: Real-World RL, Safety & Large Language Models β Chapters 91β100