Learning Sequential Decisions from Multiple Sources via Group-Robust Markov Decision Processes
arXiv:2602.01825v1 Announce Type: cross Abstract: We often collect data from multiple sites (e.g., hospitals) that share common structure but also exhibit heterogeneity. This paper aims...