Curiosity

Overall Progress 0%

ICM: forward model, prediction error as intrinsic reward; A2C on maze.

Review Volume 6 (Model-Based RL, MCTS, Dyna-Q, world models) and preview Volume 7 (Exploration — intrinsic motivation, curiosity, and sparse rewards).