Bellman

5 quick questions after Chapters 1–5 of Volume 1. Check you're ready to continue.

V^π(s), Q^π(s,a), and the Bellman expectation equation — with worked examples and explanations.

10–15 questions on MDPs, Bellman, MC vs TD, SARSA vs Q-learning. Solutions included.

15 short drill problems for Volume 1: discounted return, MDPs, value functions, Bellman equations, and dynamic programming.