Minerva: Reinforcement Learning with Verifiable Rewards for Cyber Threat Intelligence LLMs
arXiv:2602.00513v2 Announce Type: replace Abstract: Cyber threat intelligence (CTI) analysts routinely convert noisy, unstructured security artifacts into standardized, automation-ready representations. Although large language models (LLMs)...