Agent tuning zai-org/SWE-Dev-train Viewer • Updated Jul 9, 2025 • 20.1k • 1.42k • 21 SWE-Gym/OpenHands-SFT-Trajectories Viewer • Updated May 10, 2025 • 491 • 371 • 15 lmarena-ai/webdev-arena-preference-10k Viewer • Updated Mar 10, 2025 • 10.5k • 367 • 17 SWE-bench/SWE-smith-trajectories Viewer • Updated Jul 19, 2025 • 76k • 4.19k • 62
Agent Benchmarks xw27/scibench Viewer • Updated May 6, 2024 • 692 • 2.12k • 25 google/frames-benchmark Viewer • Updated Oct 15, 2024 • 824 • 15.1k • 261 gaia-benchmark/GAIA Viewer • Updated Oct 28, 2025 • 932 • 42.2k • 694 HuggingFaceH4/MATH-500 Viewer • Updated Dec 15, 2025 • 500 • 141k • 316
Agent tuning zai-org/SWE-Dev-train Viewer • Updated Jul 9, 2025 • 20.1k • 1.42k • 21 SWE-Gym/OpenHands-SFT-Trajectories Viewer • Updated May 10, 2025 • 491 • 371 • 15 lmarena-ai/webdev-arena-preference-10k Viewer • Updated Mar 10, 2025 • 10.5k • 367 • 17 SWE-bench/SWE-smith-trajectories Viewer • Updated Jul 19, 2025 • 76k • 4.19k • 62
Agent Benchmarks xw27/scibench Viewer • Updated May 6, 2024 • 692 • 2.12k • 25 google/frames-benchmark Viewer • Updated Oct 15, 2024 • 824 • 15.1k • 261 gaia-benchmark/GAIA Viewer • Updated Oct 28, 2025 • 932 • 42.2k • 694 HuggingFaceH4/MATH-500 Viewer • Updated Dec 15, 2025 • 500 • 141k • 316
Running Trackio Loss Demo Static 54ea23 🎯 Visualize your metrics instantly with the Trackio dashboard
akseljoonas/biotech-sentiment-test-A2-deberta-large-ce Text Classification • 0.4B • Updated Mar 17 • 2
akseljoonas/biotech-sentiment-A-deberta-large-focal-clean Text Classification • 0.4B • Updated Mar 17 • 5