Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning
arXiv:2506.21039v2 Announce Type: replace Abstract: Long-horizon goal-conditioned tasks pose fundamental challenges for reinforcement learning (RL), particularly when goals are distant and rewards are sparse. While...