Phase 6

Overall Progress 0%

10โ€“15 questions on MDPs, Bellman, MC vs TD, SARSA vs Q-learning. Solutions included.

Volumes 1โ€“2: MDPs, dynamic programming, Monte Carlo, TD, SARSA, and tabular Q-learning. Core theory before function approximation.