Truthfulness Despite Weak Supervision: Evaluating and Training LLMs Using Peer Prediction
arXiv:2601.20299v1 Announce Type: new Abstract: The evaluation and post-training of large language models (LLMs) rely on supervision, but strong supervision for difficult tasks is often...