Running on CPU Upgrade Agents 611 GAIA Leaderboard 🦾 611 Submit and view GAIA model evaluation leaderboard
Running 125 Berkeley Function Calling Leaderboard 🏃 125 View the Berkeley Function-Calling Leaderboard
Running Agents 231 BigCodeBench Leaderboard 🥇 231 Explore code-generation model leaderboards and task details