PER
Overall Progress
0%
Sum-tree prioritized buffer with TD error; importance-sampling weights.
Combine DDQN, Dueling, PER, NoisyNet, multi-step; train on Pong.
Sum-tree prioritized buffer with TD error; importance-sampling weights.
Combine DDQN, Dueling, PER, NoisyNet, multi-step; train on Pong.