When reward distributions change over time—exponential recency-weighted average and constant step size.