Intrinsic Reward

Overall Progress 0%

ICM: forward model, prediction error as intrinsic reward; A2C on maze.

Go to Chapter 63: Curiosity-Driven Exploration (ICM) →

RND: fixed target, predictor; prediction error as intrinsic reward.

Go to Chapter 64: Random Network Distillation (RND) →