Introduction to policy-based methods, the policy objective, REINFORCE, variance reduction, actor-critic, A2C, A3C, continuous action spaces, DDPG, and TD3. Chapters 31–40.