Construction-Verification: A Benchmark for Applied Mathematics in Lean 4
arXiv:2602.01291v1 Announce Type: new Abstract: Recent advances in large language models have demonstrated impressive capabilities in mathematical formalization. However, existing benchmarks focus on logical verification...