VIDEOP2R: Video Understanding from Perception to Reasoning Paper β’ 2511.11113 β’ Published Nov 14, 2025 β’ 112
Quantile Advantage Estimation for Entropy-Safe Reasoning Paper β’ 2509.22611 β’ Published Sep 26, 2025 β’ 120
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper β’ 2504.13837 β’ Published Apr 18, 2025 β’ 141
Running 124 Berkeley Function Calling Leaderboard π 124 View the Berkeley Function-Calling Leaderboard