SARSA

5 quick questions after Chapters 11–15 of Volume 2. Check you're ready to continue.

10–15 questions on MDPs, Bellman, MC vs TD, SARSA vs Q-learning. Solutions included.

SARSA on Cliff Walking; plot sum of rewards per episode.

Code walkthrough for TD(0) prediction, SARSA, and Q-learning (tabular).

n-step SARSA (n=4) on Cliff Walking.

15 short drill problems for Volume 2: Monte Carlo, TD(0), SARSA, Q-learning, and n-step methods.