AI & ML interests
None defined yet.
PolarisEvals/leaderboard-data
Viewer
• Updated
• 1.14M • 12
PolarisEvals/llm_dataset_completness_2stage_score_mini
Viewer
• Updated
• 10 • 5
PolarisEvals/llm_dataset_completness_2stage_score
Viewer
• Updated
• 54.3k • 5
PolarisEvals/llm_dataset_completness_2stage_justification_score
Viewer
• Updated
• 54.3k • 5
PolarisEvals/llm_dataset_completness_2stage
Viewer
• Updated
• 54.3k • 9
PolarisEvals/shikib_dataset_completeness_2stage_unittest
Viewer
• Updated
• 5.47k • 5
PolarisEvals/shikib_dataset_completeness_2stage_unittest_debug
Viewer
• Updated
• 100 • 7
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest_response
Viewer
• Updated
• 5.47k • 6
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest
Viewer
• Updated
• 912 • 7
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_filtering_debug
Viewer
• Updated
• 100 • 6
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts
Viewer
• Updated
• 912 • 5
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions_filtering_debug
Viewer
• Updated
• 100 • 8
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions
Viewer
• Updated
• 982 • 5
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_gpt-4-0613_outputs_json_True_debug
Viewer
• Updated
• 100 • 6
PolarisEvals/training_criteria_dpo_distill_relevance_gpt-4-0613_outputs_json_True_debug
Viewer
• Updated
• 100 • 6
PolarisEvals/training_criteria_dpo_distill
Viewer
• Updated
• 912 • 6
PolarisEvals/synqa_hudson_300_samples_relevance_gpt-4-0613_outputs_json_True_debug
Viewer
• Updated
• 100 • 7
PolarisEvals/synqa_hudson_300_samples_completeness_gpt-4-0613_outputs_json_True_debug
Viewer
• Updated
• 100 • 6
PolarisEvals/synqa_hudson_300_samples
Viewer
• Updated
• 1.5k • 5
PolarisEvals/synqa_hudson_300_samples_clarity_gpt-4-0613_outputs_json_True_debug
Viewer
• Updated
• 100 • 9
PolarisEvals/synqa_hudson_300_queries_rubrics_score_completeness_gpt-4-0613_outputs_json_True
Viewer
• Updated
• 10 • 6
PolarisEvals/synqa_hudson_300_queries_rubrics_score_completeness_gpt-4-0613_outputs_json_False
Viewer
• Updated
• 10 • 6
PolarisEvals/synqa_hudson_300_queries_rubrics_score
Viewer
• Updated
• 7.5k • 5
PolarisEvals/synqa_hudson_300_samples_gpt-4-0613_outputs
Viewer
• Updated
• 81 • 9