Invariance-Based Dynamic Regret Minimization
arXiv:2603.03843v1 Announce Type: cross Abstract: We consider stochastic non-stationary linear bandits where the linear parameter connecting contexts to the reward changes over time. Existing algorithms...