Workflow-R1: Group Sub-sequence Policy Optimization for Multi-turn Workflow Construction
arXiv:2602.01202v1 Announce Type: new Abstract: The rapid evolution of agentic workflows has demonstrated strong performance of LLM-based agents in addressing complex reasoning tasks. However, existing...