QEDBENCH: Quantifying the Alignment Gap in Automated Evaluation of University-Level Mathematical Proofs Paper • 2602.20629 • Published 8 days ago • 1