RL in robotics, safe reinforcement learning, algorithmic trading, recommender systems, training LLMs with PPO, implementing RLHF, Direct Preference Optimization (DPO), evaluating RL agents, debugging RL code, and the future of RL. Chapters 91–100.