Training

Overall Progress 0%

MSE for regression, cross-entropy for classification, and the TD error loss in DQN โ€” how loss functions guide neural network training.

The chain rule applied backwards through a neural network โ€” computing gradients for every weight and verifying them with numerical finite differences.