Toolkits (DUPLICATE them, never use the public ones) A set of tools to enable finetuning, evaluations, prototyping, agentic workflows etc. ATTENTION: ALWAYS DUPLICATE THESE SPACES ON OUR INFRA!!! Running 125 AutoTrain Advanced 🚀 125 Create powerful AI models without code Runtime error Agents 40 LLM Merge Adapter 🐢 40 Runtime error Agents Featured 290 mergekit-gui 🔀 290 Merge AI models using a YAML configuration file
Benchmarks Most commonly used leaderboards to check model capabilities Runtime error 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots Running Featured 455 LLM Performance Leaderboard 🐨 455 View the latest LLM performance leaderboard online Running 4.88k Arena Leaderboard 🏆 4.88k View the LMArena model leaderboard Running on CPU Upgrade 7.34k MTEB Leaderboard 🥇 7.34k Embedding Leaderboard
Running Featured 455 LLM Performance Leaderboard 🐨 455 View the latest LLM performance leaderboard online
Toolkits (DUPLICATE them, never use the public ones) A set of tools to enable finetuning, evaluations, prototyping, agentic workflows etc. ATTENTION: ALWAYS DUPLICATE THESE SPACES ON OUR INFRA!!! Running 125 AutoTrain Advanced 🚀 125 Create powerful AI models without code Runtime error Agents 40 LLM Merge Adapter 🐢 40 Runtime error Agents Featured 290 mergekit-gui 🔀 290 Merge AI models using a YAML configuration file
Benchmarks Most commonly used leaderboards to check model capabilities Runtime error 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots Running Featured 455 LLM Performance Leaderboard 🐨 455 View the latest LLM performance leaderboard online Running 4.88k Arena Leaderboard 🏆 4.88k View the LMArena model leaderboard Running on CPU Upgrade 7.34k MTEB Leaderboard 🥇 7.34k Embedding Leaderboard
Running Featured 455 LLM Performance Leaderboard 🐨 455 View the latest LLM performance leaderboard online