Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs Paper • 2605.09063 • Published 18 days ago • 79
Running on CPU Upgrade 514 Visualize Dataset (v2.0+ latest dataset format) 💻 514 Explore and visualize LeRobot datasets easily