Chain Rule

Derivatives, chain rule, and partial derivatives — with RL motivation and practice.

Derivatives, chain rule, sigmoid and softmax — with RL motivation and explained solutions.

The chain rule applied backwards through a neural network — computing gradients for every weight and verifying them with numerical finite differences.