Conversation for Non-verifiable Learning: Self-Evolving LLMs through Meta-Evaluation
arXiv:2601.21464v1 Announce Type: new Abstract: Training large language models (LLMs) for non-verifiable tasks, such as creative writing, dialogue, and ethical reasoning, remains challenging due to...