Well-Posed KL-Regularized Control via Wasserstein and Kalman-Wasserstein KL Divergences
arXiv:2602.02250v1 Announce Type: cross Abstract: Kullback-Leibler divergence (KL) regularization is widely used in reinforcement learning, but it becomes infinite under support mismatch and can degenerate...