Skip to main content

Learn
search
tags
Archives

Upper Confidence Bound

Overall Progress 0%

Upper Confidence Bound (UCB1) algorithm for multi-armed bandits—balance exploration and exploitation using uncertainty.

Go to Bandits: UCB1 →

© 2026 Reinforcement Learning Curriculum