Planning

Overall Progress 0%

Dyna-Q on 4×4 deterministic gridworld.

Go to Chapter 17: Planning and Learning with Tabular Methods →

BFS planner for gridworld; compare with DP.

Go to Chapter 53: Planning with Known Models →

MCTS for tic-tac-toe with UCT; play vs random.

Go to Chapter 54: Monte Carlo Tree Search (MCTS) →