Phase 6

Overall Progress 0%

10–15 questions on MDPs, Bellman, MC vs TD, SARSA vs Q-learning. Solutions included.

Volumes 1–2: MDPs, dynamic programming, Monte Carlo, TD, SARSA, and tabular Q-learning. Core theory before function approximation.