Debugging

Overall Progress 0%

Practical guide to finding and fixing common RL bugs. Includes 5 find-the-bug exercises with solutions.

Broken SAC: unit tests, logging Q/reward/entropy; diagnose.