After Phase 5 you can implement Q-networks and policy networks in PyTorch; Phase 6 adds RL semantics (MDPs, Bellman, tabular methods).
Phase 5
Phase 5 — DL foundations
Neural networks, backpropagation, CNNs, PyTorch patterns, and a mini-project—directly reusable for DQN, policies, and actor-critic.
Module progress