Phase 7
Overall Progress
0%
10โ12 questions on DQN, policy gradient, PPO, replay, target network. Solutions included.
Volumes 3โ5: value function approximation, DQN family, policy gradients, actor-critic, and advanced policy optimization (chapters 21โ50).