Evaluating Chain-of-Thought Reasoning through Reusability and Verifiability
arXiv:2602.17544v1 Announce Type: new Abstract: In multi-agent IR pipelines for tasks such as search and ranking, LLM-based agents exchange intermediate reasoning in terms of Chain-of-Thought...