MARO: Learning Stronger Reasoning from Social Interaction
arXiv:2601.12323v2 Announce Type: replace Abstract: Humans face countless scenarios that require reasoning and judgment in daily life. However, existing large language model training methods primarily...