MCTS | Reinforcement Learning Curriculum

MCTS for tic-tac-toe with UCT; play vs random.

Go to Chapter 54: Monte Carlo Tree Search (MCTS) →

Mini AlphaZero for tic-tac-toe: NN + MCTS, self-play.

Go to Chapter 55: AlphaZero Architecture →

Review Volume 6 (Model-Based RL, MCTS, Dyna-Q, world models) and preview Volume 7 (Exploration — intrinsic motivation, curiosity, and sparse rewards).

Go to Volume 6 Review & Bridge to Volume 7 →