Running on CPU Upgrade Agents 604 GAIA Leaderboard ๐ฆพ 604 Submit and score your model on the GAIA benchmark
Running 124 Berkeley Function Calling Leaderboard ๐ 124 View the Berkeley Function-Calling Leaderboard
Running Agents 230 BigCodeBench Leaderboard ๐ฅ 230 Explore code-generation model leaderboards and task details