rubricreward/PolyGuardMix-tgt_prompt_en_thinking
Viewer
• Updated • 2.92M • 10
rubricreward/PolyGuardMix-en_prompt_en_thinking
Viewer
• Updated • 2.92M • 21
rubricreward/arena-human-preference-tgt_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 61.1k • 3
rubricreward/arena-human-preference-tgt_prompt_en_thinking-filtered_correct
Viewer
• Updated • 60.7k • 4
rubricreward/arena-human-preference-en_prompt_en_thinking-filtered_correct
Viewer
• Updated • 60.3k • 12
rubricreward/arena-human-preference-tgt_prompt_tgt_thinking
Viewer
• Updated • 120k • 3
rubricreward/arena-human-preference-tgt_prompt_en_thinking
Viewer
• Updated • 120k • 6
rubricreward/arena-human-preference-en_prompt_en_thinking
Viewer
• Updated • 120k • 24
rubricreward/PolyGuardMix
Viewer
• Updated • 2.93M • 3
rubricreward/mR3-Dataset-Filtered1
Viewer
• Updated • 696k • 2
rubricreward/PolyGuardMix-filtered-tgt_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 624k • 8
rubricreward/PolyGuardMix-filtered-tgt_prompt_en_thinking-filtered_correct
Viewer
• Updated • 631k • 3
rubricreward/PolyGuardMix-filtered-en_prompt_en_thinking-filtered_correct
Viewer
• Updated • 638k • 3
rubricreward/PolyGuardMix-filtered-tgt_prompt_tgt_thinking
Viewer
• Updated • 890k • 3
rubricreward/PolyGuardMix-filtered-tgt_prompt_en_thinking
Viewer
• Updated • 904k • 6
• 1
rubricreward/PolyGuardMix-filtered-en_prompt_en_thinking
Viewer
• Updated • 903k • 7
Viewer
• Updated • 40.5k • 3
rubricreward/HelpSteer3-en_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 12.7k • 6
rubricreward/HumanEval-XL-Python-tgt_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 3.1k • 3
rubricreward/MATH-500-Multilingual-tgt_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 4.57k • 10
rubricreward/MMMLU-tgt_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 148k • 5
rubricreward/HumanEval-XL-Python-tgt_prompt_en_thinking-filtered_correct
Viewer
• Updated • 3.2k • 8
rubricreward/MATH-500-Multilingual-tgt_prompt_en_thinking-filtered_correct
Viewer
• Updated • 4.83k • 3
rubricreward/MMMLU-tgt_prompt_en_thinking-filtered_correct
Viewer
• Updated • 158k • 3
rubricreward/HumanEval-XL-Python-en_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 3.09k • 3
rubricreward/MATH-500-Multilingual-en_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 4.59k • 5
rubricreward/MMMLU-en_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 148k • 3
rubricreward/arena-human-preference-en_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 51.4k • 3
rubricreward/HumanEval-XL-Python-en_prompt_en_thinking-filtered_correct
Viewer
• Updated • 3.25k • 6
rubricreward/MATH-500-Multilingual-en_prompt_en_thinking-filtered_correct
Viewer
• Updated • 4.83k • 6