Review

Review deep learning and see why RL needs neural networks — the bridge to DQN and policy gradients.

Go to DL Foundations Review & Bridge to RL →

Review ML Foundations and see why linear models fail on complex patterns — motivation for neural networks.

Go to ML Foundations Review & Bridge to Deep Learning →

Review Volume 1 concepts and preview Volume 2. From dynamic programming (model-given) to model-free methods.

Go to Volume 1 Review & Bridge to Volume 2 →

Review Volume 2 tabular methods and preview Volume 3. From Q-tables to neural network function approximation.

Go to Volume 2 Review & Bridge to Volume 3 →

Review Volume 3 (DQN and variants) and preview Volume 4 (Policy Gradients). From value-based to policy-based methods.

Go to Volume 3 Review & Bridge to Volume 4 →

Review Volume 4 (Policy Gradients, Actor-Critic, DDPG, TD3) and preview Volume 5 (PPO, TRPO, SAC — stable, scalable policy optimization).

Go to Volume 4 Review & Bridge to Volume 5 →

Review Volume 5 (PPO, TRPO, SAC) and preview Volume 6 (Model-Based RL — learning world models and planning).

Go to Volume 5 Review & Bridge to Volume 6 →

Review Volume 6 (Model-Based RL, MCTS, Dyna-Q, world models) and preview Volume 7 (Exploration — intrinsic motivation, curiosity, and sparse rewards).

Go to Volume 6 Review & Bridge to Volume 7 →

Review Volume 7 (Exploration, ICM, RND, Go-Explore, Meta-RL) and preview Volume 8 (Offline RL, Imitation Learning, RLHF).

Go to Volume 7 Review & Bridge to Volume 8 →

Review Volume 8 (Offline RL, Imitation Learning, IRL, RLHF) and preview Volume 9 (Multi-Agent RL — cooperation, competition, game theory).

Go to Volume 8 Review & Bridge to Volume 9 →

Review Volume 9 (Multi-Agent RL, game theory, QMIX, MAPPO) and preview Volume 10 (Real-World RL — safety, alignment, LLMs, deployment).

Go to Volume 9 Review & Bridge to Volume 10 →