MARL

Overall Progress 0%

Model Rock-Paper-Scissors as Dec-POMDP.

Go to Chapter 81: Multi-Agent Fundamentals →

Nash equilibrium of 2×2 matrix; independent learning outcome.

Go to Chapter 82: Game Theory Basics for RL →

Explain CTDE with example; why it helps non-stationarity.

Go to Chapter 84: Centralized Training, Decentralized Execution (CTDE) →

Agents output message + action; train for coordination task.

Go to Chapter 90: Communication in MARL →