Multilingual MT-Bench LGAI-EXAONE/KoMT-Bench Viewer • Updated Aug 8, 2024 • 80 • 145 • 40 StudentLLM/Korean_MT-Bench_questions Viewer • Updated Jul 10, 2023 • 80 • 13 • 1 naive-puzzle/japanese-mt-bench Viewer • Updated Aug 2, 2024 • 380 • 26 • 1 karakuri-ai/corrected-mt-bench-ja Viewer • Updated Jul 11, 2024 • 80 • 78 • 1
Foundation Corpus HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 340k • 1.02k mlfoundations/dclm-baseline-1.0 Preview • Updated Jul 22, 2024 • 154k • 262 Zyphra/Zyda-2 Preview • Updated Aug 6, 2025 • 184k • 90 LLM360/TxT360 Updated May 26, 2025 • 43k • 248
Benchmark openai/MMMLU Viewer • Updated Oct 16, 2024 • 393k • 7.25k • 519 google/IFEval Viewer • Updated Aug 14, 2024 • 541 • 84.8k • 144
Reward Datasets nvidia/HelpSteer2 Viewer • Updated Dec 18, 2024 • 21.4k • 3.64k • 443 Skywork/Skywork-Reward-Preference-80K-v0.1 Viewer • Updated Oct 25, 2024 • 82k • 121 • 47
Math MathGenie/MathCode-Pile Viewer • Updated Oct 16, 2024 • 719k • 260 • 25 TIGER-Lab/MathInstruct Viewer • Updated May 15, 2024 • 262k • 11.4k • 301
benchmark-target tasksource/tasksource_dpo_pairs Viewer • Updated Jul 1, 2024 • 5.13M • 103 • 21 euclaise/SuperMC Viewer • Updated Jan 25, 2024 • 278k • 18 • 2 argilla/ifeval-like-data Viewer • Updated Oct 17, 2024 • 606k • 690 • 47
Benchmark-Korean skt/kobest_v1 Viewer • Updated Mar 28, 2024 • 23.4k • 2.83k • 54 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 6.44k • 97 HAERAE-HUB/KMMLU-HARD Viewer • Updated Mar 9, 2024 • 4.33k • 800 • 13 maywell/LogicKor Preview • Updated Jun 9, 2024 • 65 • 21
Korean Datasets [Good] HAERAE-HUB/Korean-Human-Judgements Viewer • Updated Jun 30, 2024 • 694 • 35 • 38
Multilingual MT-Bench LGAI-EXAONE/KoMT-Bench Viewer • Updated Aug 8, 2024 • 80 • 145 • 40 StudentLLM/Korean_MT-Bench_questions Viewer • Updated Jul 10, 2023 • 80 • 13 • 1 naive-puzzle/japanese-mt-bench Viewer • Updated Aug 2, 2024 • 380 • 26 • 1 karakuri-ai/corrected-mt-bench-ja Viewer • Updated Jul 11, 2024 • 80 • 78 • 1
Math MathGenie/MathCode-Pile Viewer • Updated Oct 16, 2024 • 719k • 260 • 25 TIGER-Lab/MathInstruct Viewer • Updated May 15, 2024 • 262k • 11.4k • 301
Foundation Corpus HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 340k • 1.02k mlfoundations/dclm-baseline-1.0 Preview • Updated Jul 22, 2024 • 154k • 262 Zyphra/Zyda-2 Preview • Updated Aug 6, 2025 • 184k • 90 LLM360/TxT360 Updated May 26, 2025 • 43k • 248
benchmark-target tasksource/tasksource_dpo_pairs Viewer • Updated Jul 1, 2024 • 5.13M • 103 • 21 euclaise/SuperMC Viewer • Updated Jan 25, 2024 • 278k • 18 • 2 argilla/ifeval-like-data Viewer • Updated Oct 17, 2024 • 606k • 690 • 47
Benchmark openai/MMMLU Viewer • Updated Oct 16, 2024 • 393k • 7.25k • 519 google/IFEval Viewer • Updated Aug 14, 2024 • 541 • 84.8k • 144
Benchmark-Korean skt/kobest_v1 Viewer • Updated Mar 28, 2024 • 23.4k • 2.83k • 54 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 6.44k • 97 HAERAE-HUB/KMMLU-HARD Viewer • Updated Mar 9, 2024 • 4.33k • 800 • 13 maywell/LogicKor Preview • Updated Jun 9, 2024 • 65 • 21
Reward Datasets nvidia/HelpSteer2 Viewer • Updated Dec 18, 2024 • 21.4k • 3.64k • 443 Skywork/Skywork-Reward-Preference-80K-v0.1 Viewer • Updated Oct 25, 2024 • 82k • 121 • 47
Korean Datasets [Good] HAERAE-HUB/Korean-Human-Judgements Viewer • Updated Jun 30, 2024 • 694 • 35 • 38