Machine Learning Foundations

Supervised learning, gradient descent, classification, and evaluation — the ML foundations you need before deep RL.

Overall Progress 0%

Three types of ML: supervised, unsupervised, and reinforcement — and why learning from data beats hand-written rules.

Go to What is Machine Learning? →

Features, labels, and how ML data is structured as rows (samples) and columns (features) in a DataFrame.

Go to Datasets and Features →

Predict a continuous value with ŷ = wx + b. Derive the MSE loss and compute one gradient descent step from scratch.

Go to Linear Regression →

The optimization algorithm behind every trained ML model: iteratively follow the negative gradient to minimize a loss.

Go to Gradient Descent →

Extend linear regression to multiple features using matrix form ŷ = Xw + b and vectorized NumPy operations.

Go to Multiple Regression →

Predict categories instead of numbers. Decision boundaries, sigmoid activation, and binary probability outputs.

Go to Classification Concepts →

Binary classifier from scratch: sigmoid + cross-entropy loss + gradient update. The building block of softmax policies.

Go to Logistic Regression →

Train/test split, accuracy, precision, recall, and F1 — evaluating classifiers honestly.

Go to Model Evaluation →

K-fold cross-validation, overfitting vs underfitting, and the bias-variance tradeoff.

Go to Cross-Validation and Overfitting →

Classify new points by majority vote among K closest training examples.

Go to K-Nearest Neighbors →

If/else questions on features, entropy, and information gain as splitting criteria.

Go to Decision Trees →

Unsupervised grouping of data by alternating assignment and centroid-update steps.

Go to K-Means Clustering →

The full sklearn pipeline: fit, predict, score, and comparing multiple models.

Go to Scikit-Learn Workflow →

End-to-end ML project combining loading, exploration, preprocessing, training, and evaluation.

Go to ML Mini-Project: Wine Classification →

15 short drill problems covering supervised learning, gradient descent, evaluation, and sklearn.

Go to ML Foundations Drills →

Review ML Foundations and see why linear models fail on complex patterns — motivation for neural networks.

Go to ML Foundations Review & Bridge to Deep Learning →

What this section covers

Machine learning is the engine beneath every modern RL algorithm. Before you can implement DQN, PPO, or any deep RL method, you need to be fluent in supervised learning concepts — loss functions, gradient descent, classification, and model evaluation. This section builds that foundation systematically, from first principles to scikit-learn.

Topics covered:

What machine learning is and how it differs from traditional programming
How data is structured as features and labels for ML models
Linear regression: MSE loss, the gradient, and one gradient step
Gradient descent: the optimization algorithm that trains every neural network
Multiple regression: matrix form, NumPy vectorization, multi-feature problems
Classification concepts: decision boundaries, sigmoid, binary decisions
Logistic regression: cross-entropy loss, softmax policy connection
Model evaluation: accuracy, precision, recall, F1, confusion matrices
Overfitting and underfitting: regularization, train/test splits, bias–variance
Scikit-learn workflows: pipelines, model selection, cross-validation
Decision trees and random forests: non-linear models and feature importance
Neural network basics: layers, activations, forward pass
Backpropagation: the algorithm that computes gradients in deep networks
Review and bridge: how every concept here reappears inside RL algorithms

Why ML foundations matter for RL

RL IS ML. Understanding supervised learning first makes every RL algorithm click:

ML concept	Where it reappears in RL
Linear regression	Value function approximation \(V(s) = w^T \phi(s)\)
Gradient descent	Policy gradient, Q-learning updates
Classification	Policy \(\pi(a \mid s)\) — choosing an action from a state
Logistic regression	Softmax policy over discrete actions
Cross-entropy loss	Policy gradient objective
Overfitting	Generalization in deep RL agents
Neural networks	Deep Q-Networks (DQN), actor–critic networks
Backpropagation	How policy and value networks are trained

Every page in this section ends with an explicit RL connection so you always know why you are learning it.

#	Page	Topic
1	What is ML?	Three types of ML, supervised vs RL
2	Datasets and Features	X, y, DataFrames, pandas
3	Linear Regression	MSE, gradient, one step
4	Gradient Descent	Learning rate, loss curves
5	Multiple Regression	Matrix form, NumPy
6	Classification Concepts	Decision boundary, sigmoid
7	Logistic Regression	Cross-entropy, gradient update
8	Model Evaluation	Accuracy, precision, recall, F1
9	Overfitting	Regularization, train/test split
10	Scikit-learn Workflows	Pipelines, cross-validation
11	Decision Trees	Non-linear models, feature importance
12	Neural Networks Intro	Layers, activations, forward pass
13	Backpropagation	Chain rule, gradient flow
14	Review and Bridge to RL	Connecting everything to RL

Quick-start guide

Complete pages in order. Each page builds on the previous one. Do not skip.
Do every exercise. The pyrepl blocks run in your browser — no setup needed.
Check the worked solutions only after a genuine attempt. The struggle is where the learning happens.
Use the extra practice items. Items 5 (Debug) and 3 (Challenge) are especially valuable.
Revisit the RL connection at the bottom of each page. Ask yourself: “Where have I seen this in RL already?”

Estimated time: 2–4 hours per page for a thorough reading + all exercises. The full section takes approximately 35–50 hours.

Assessment checkpoints

After every four pages, check your understanding:

After page 4 — Checkpoint A: Regression and Optimization — Can you implement gradient descent from scratch in NumPy?
After page 7 — Checkpoint B: Classification — Can you train logistic regression and explain cross-entropy?
After page 10 — Checkpoint C: Evaluation and sklearn — Can you evaluate a model correctly and avoid overfitting?
After page 14 — Checkpoint D: Bridge to RL — Can you name the RL equivalent of each ML concept?

Machine Learning Foundations

What is Machine Learning?

Datasets and Features

Linear Regression

Gradient Descent

Multiple Regression

Classification Concepts

Logistic Regression

Model Evaluation

Cross-Validation and Overfitting

K-Nearest Neighbors

Decision Trees

K-Means Clustering

Scikit-Learn Workflow

ML Mini-Project: Wine Classification

ML Foundations Drills

ML Foundations Review & Bridge to Deep Learning

What this section covers

Why ML foundations matter for RL

Table of contents

Quick-start guide

Assessment checkpoints