Multilingual MT-Bench LGAI-EXAONE/KoMT-Bench Viewer • Updated Aug 8, 2024 • 80 • 258 • 42 StudentLLM/Korean_MT-Bench_questions Viewer • Updated Jul 10, 2023 • 80 • 21 • 1 naive-puzzle/japanese-mt-bench Viewer • Updated Aug 2, 2024 • 380 • 14 • 1 karakuri-ai/corrected-mt-bench-ja Viewer • Updated Jul 11, 2024 • 80 • 11 • 1
Foundation Corpus HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 518k • 1.05k mlfoundations/dclm-baseline-1.0 Preview • Updated Jul 22, 2024 • 160k • 264 Zyphra/Zyda-2 Preview • Updated Aug 6, 2025 • 18.9k • 91 LLM360/TxT360 Updated May 26, 2025 • 1.22M • 253
Benchmark openai/MMMLU Viewer • Updated Oct 16, 2024 • 393k • 11.1k • 522 google/IFEval Viewer • Updated Aug 14, 2024 • 541 • 117k • 148
Reward Datasets nvidia/HelpSteer2 Viewer • Updated Dec 18, 2024 • 21.4k • 4.53k • 454 Skywork/Skywork-Reward-Preference-80K-v0.1 Viewer • Updated Oct 25, 2024 • 82k • 197 • 48
Math MathGenie/MathCode-Pile Viewer • Updated Oct 16, 2024 • 719k • 543 • 25 TIGER-Lab/MathInstruct Viewer • Updated May 15, 2024 • 262k • 7.87k • 303
benchmark-target tasksource/tasksource_dpo_pairs Viewer • Updated Jul 1, 2024 • 5.13M • 234 • 21 euclaise/SuperMC Viewer • Updated Jan 25, 2024 • 278k • 57 • 2 argilla/ifeval-like-data Viewer • Updated Oct 17, 2024 • 606k • 1.12k • 49
Benchmark-Korean skt/kobest_v1 Viewer • Updated Mar 28, 2024 • 23.4k • 3.76k • 54 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 11.4k • 97 HAERAE-HUB/KMMLU-HARD Viewer • Updated Mar 9, 2024 • 4.33k • 1.84k • 13 maywell/LogicKor Preview • Updated Jun 9, 2024 • 111 • 21
Korean Datasets [Good] HAERAE-HUB/Korean-Human-Judgements Viewer • Updated Jun 30, 2024 • 694 • 131 • 38
Multilingual MT-Bench LGAI-EXAONE/KoMT-Bench Viewer • Updated Aug 8, 2024 • 80 • 258 • 42 StudentLLM/Korean_MT-Bench_questions Viewer • Updated Jul 10, 2023 • 80 • 21 • 1 naive-puzzle/japanese-mt-bench Viewer • Updated Aug 2, 2024 • 380 • 14 • 1 karakuri-ai/corrected-mt-bench-ja Viewer • Updated Jul 11, 2024 • 80 • 11 • 1
Math MathGenie/MathCode-Pile Viewer • Updated Oct 16, 2024 • 719k • 543 • 25 TIGER-Lab/MathInstruct Viewer • Updated May 15, 2024 • 262k • 7.87k • 303
Foundation Corpus HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 518k • 1.05k mlfoundations/dclm-baseline-1.0 Preview • Updated Jul 22, 2024 • 160k • 264 Zyphra/Zyda-2 Preview • Updated Aug 6, 2025 • 18.9k • 91 LLM360/TxT360 Updated May 26, 2025 • 1.22M • 253
benchmark-target tasksource/tasksource_dpo_pairs Viewer • Updated Jul 1, 2024 • 5.13M • 234 • 21 euclaise/SuperMC Viewer • Updated Jan 25, 2024 • 278k • 57 • 2 argilla/ifeval-like-data Viewer • Updated Oct 17, 2024 • 606k • 1.12k • 49
Benchmark openai/MMMLU Viewer • Updated Oct 16, 2024 • 393k • 11.1k • 522 google/IFEval Viewer • Updated Aug 14, 2024 • 541 • 117k • 148
Benchmark-Korean skt/kobest_v1 Viewer • Updated Mar 28, 2024 • 23.4k • 3.76k • 54 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 11.4k • 97 HAERAE-HUB/KMMLU-HARD Viewer • Updated Mar 9, 2024 • 4.33k • 1.84k • 13 maywell/LogicKor Preview • Updated Jun 9, 2024 • 111 • 21
Reward Datasets nvidia/HelpSteer2 Viewer • Updated Dec 18, 2024 • 21.4k • 4.53k • 454 Skywork/Skywork-Reward-Preference-80K-v0.1 Viewer • Updated Oct 25, 2024 • 82k • 197 • 48
Korean Datasets [Good] HAERAE-HUB/Korean-Human-Judgements Viewer • Updated Jun 30, 2024 • 694 • 131 • 38