This track covers the math you need to read and do reinforcement learning: probability & statistics, linear algebra, and calculus. Each topic is tied to how it appears in RL (bandits, value functions, gradients). Each topic page includes practice questions with full step-by-step solutions (collapsible “Answer and explanation”) so you can check your work and see the derivation. Work through the pages in order, or use them to fill gaps after the Preliminary assessment.

Recommended order: Probability & statisticsLinear algebraCalculus.


Why this math matters in RL


TopicContentRL use
Probability & statisticsExpectations, variance, sample mean, distributions, law of large numbersBandit rewards, MC returns, policy evaluation
Linear algebraVectors, dot product, matrices, gradientsState vectors, value parameterization, gradient updates
CalculusDerivatives, chain rule, partial derivativesPolicy gradient, loss gradients, backprop

After finishing this track, take the Phase 1 self-check (10 questions). If you pass, you are ready for Phase 2 and Volume 1.