stephen-flood 's Collections Benchmarks
updated
Viewer
• Updated
• 2.09k • 130
• 4
Viewer
• Updated
• 5.82M • 18.6k
• 43
Viewer
• Updated
• 231k • 361k
• 681
Benchmark
• Updated
• 17.6k • 535k
• 1.19k
Viewer
• Updated
• 19.6k • 7
lighteval/legal_summarization
Viewer
• Updated
• 26.9k • 254
• 25
Viewer
• Updated
• 1.6k • 179
• 2
lighteval/synthetic_reasoning
Viewer
• Updated
• 33k • 105
• 8
lighteval/synthetic_reasoning_natural
Viewer
• Updated
• 22k • 59
• 15
Viewer
• Updated
• 90.3k • 77
• 3
lighteval/GPT3_unscramble
Viewer
• Updated
• 50k • 19
• 1
lighteval/aimo_progress_prize_1
Viewer
• Updated
• 10 • 19
Viewer
• Updated
• 1.7k • 9
Viewer
• Updated
• 72.5k • 3.34k
• 143
Viewer
• Updated
• 860k • 13.4k
• 540
Text Classification
• Updated
• 56.4k
• 82
Jofthomas/hermes-function-calling-thinking-V1
Viewer
• Updated
• 3.57k • 471
• 74
NousResearch/hermes-function-calling-v1
Viewer
• Updated
• 11.6k • 4.77k
• 382
Viewer
• Updated
• 15.7k • 59
• 6
Viewer
• Updated
• 621M • 10.9k
• 87
open-web-math/open-web-math
Viewer
• Updated
• 6.32M • 12.5k
• 329