LACONIC: Length-Aware Constrained Reinforcement Learning for LLM
arXiv:2602.14468v1 Announce Type: new Abstract: Reinforcement learning (RL) has enhanced the capabilities of large language models (LLMs) through reward-driven training. Nevertheless, this process can introduce...