PER

Overall Progress 0%

Sum-tree prioritized buffer with TD error; importance-sampling weights.

Go to Chapter 28: Prioritized Experience Replay (PER) →

Combine DDQN, Dueling, PER, NoisyNet, multi-step; train on Pong.

Go to Chapter 30: Rainbow DQN →