The Reinforcement Learning Path

From absolute beginner to Deep Reinforcement Learning expert. Follow this structured roadmap to master the concepts in the correct order — every step builds on the last.

Overall Progress 0%

The foundational language for modern AI and data science. You will learn to write Python programs from scratch, work with essential data structures, and build the coding habits that power every RL implementation.

Key Topics to Learn

Basic Syntax, Variables, and Data Types
Control Flow (If/Else, Loops)
Functions and Object-Oriented Programming
Data Structures (Lists, Dictionaries, Sets)
25 Mini-Challenges — Python Confidence Builder
Debugging and Reading Error Messages

Start Phase 0 →

Master the scientific Python stack that underpins all RL implementations: array operations with NumPy, data manipulation with Pandas, and learning-curve visualisations with Matplotlib.

Key Topics to Learn

NumPy Arrays, Broadcasting, Indexing
Vectorised Operations and Random Sampling
Pandas DataFrames and CSV I/O
Matplotlib: Line Plots, Subplots, Heatmaps
Smoothing Learning Curves and Error Bars
Mini-Project: 3-Armed Bandit with NumPy

Start Phase 2 →

The core mathematical concepts that explain what RL algorithms are actually doing. Every topic is taught with RL motivation — so every formula has a purpose you can see in practice.

Key Topics to Learn

Linear Algebra: Vectors, Matrices, Dot Products
Calculus: Derivatives, Chain Rule, Partial Derivatives
Probability: Random Variables, Expectation, Variance
Statistics: Mean, Std Dev, Standard Error
Distributions: Normal, Bernoulli, Bayes Theorem
15 Practice Problems per Topic (45 total)

Start Phase 3 →

Understanding how machines learn from data is the bridge to RL. You will build every algorithm from scratch in NumPy first, then see how scikit-learn wraps it — so nothing is a black box.

Key Topics to Learn

Supervised Learning: Regression vs. Classification
Loss Functions and Gradient Descent from Scratch
Logistic Regression and Cross-Entropy
Model Evaluation: Train/Test Split, Precision, Recall
K-Fold Cross-Validation, KNN, Decision Trees
Practical Implementation with Scikit-Learn

Start Phase 4 →

The architecture that powers modern RL. You will implement forward propagation and backpropagation from scratch in NumPy — understanding every gradient — then connect to PyTorch for practical RL networks.

Key Topics to Learn

Perceptrons and Multi-Layer Perceptrons (MLP)
Activation Functions: ReLU, Sigmoid, Tanh, Softmax
Forward Propagation (NumPy, step-by-step)
Backpropagation and Chain Rule (NumPy)
SGD, Momentum, and Adam Optimisers
CNNs, PyTorch nn.Module, Mini-Project: MNIST

Start Phase 5 →

The core RL framework — where learning by interaction begins. You will understand MDPs, value functions, and Bellman equations, and implement tabular methods that form the foundation for everything that follows.

Key Topics to Learn

Agent-Environment Interface and Rewards
Markov Decision Processes (MDPs)
Value Functions V(s) and Q(s,a)
Bellman Equations and Dynamic Programming
Monte Carlo Methods and Temporal Difference
Q-Learning and SARSA (tabular)

Start Phase 6 →

Combine neural networks with RL to solve complex, high-dimensional problems. This phase covers the algorithms that power real-world applications — from Atari games to robotic control.

Key Topics to Learn

Deep Q-Networks (DQN) and Experience Replay
Target Networks and Double DQN
Policy Gradient Methods (REINFORCE)
Actor-Critic Architecture (A2C)
Proximal Policy Optimisation (PPO)
Soft Actor-Critic (SAC)

Start Phase 7 →

Cutting-edge research areas and specialised applications: from model-based planning and multi-agent coordination to offline RL, imitation learning, and RLHF — the technique behind modern LLMs.

Key Topics to Learn

Model-Based RL and World Models
Multi-Agent RL (MARL) and QMIX
Offline RL and Conservative Q-Learning
Imitation Learning and Inverse RL
Exploration and Meta-Learning
RLHF — RL from Human Feedback (used in LLMs)

Start Phase 8 →

The Reinforcement Learning Path

Python Programming

Data Tools

Mathematics for AI

Machine Learning Basics

Neural Networks & Deep Learning

RL Foundations

Deep Reinforcement Learning

Advanced RL Paradigms