Training
Overall Progress
0%
MSE for regression, cross-entropy for classification, and the TD error loss in DQN โ how loss functions guide neural network training.
The chain rule applied backwards through a neural network โ computing gradients for every weight and verifying them with numerical finite differences.