Reliable and Efficient Amortized Model-Based Evaluation Datasets and Models for the REEval project stair-lab/reeval Viewer • Updated Jun 21, 2025 • 5.69M • 68 • 1 stair-lab/reeval-difficulty-for-helm Viewer • Updated Mar 18, 2025 • 217k • 39 • 1
Gathering Context for Decision Support with LLMs stair-lab/bosd_initial_dataset Viewer • Updated Jan 7, 2025 • 568 • 6
Dynamics of Learning Datasets and Models for the CodeInsights Projects stair-lab/code_insights_jsons Preview • Updated Dec 11, 2024 • 9 stair-lab/code_insights_csv Viewer • Updated about 10 hours ago • 3.07M • 22 • 1 stair-lab/code_insights_matrices Preview • Updated Dec 12, 2024 • 7 stair-lab/code-insights-llm_simulator Text Generation • 8B • Updated Sep 8, 2024
Nonmyopic Bayesian Optimization in Dynamic Cost Settings Datasets and Models for the Nonmyopic BO project stair-lab/semi_synthetic_protein_2p12_gemma_7b Viewer • Updated Dec 18, 2024 • 12.3k • 8 stair-lab/proteinea_fluorescence-embedding Viewer • Updated Dec 18, 2024 • 188k • 40
Finetuning and Comprehensive Evaluation of Vietnamese LLM stair-lab/MATH_vi Viewer • Updated Sep 1, 2024 • 25k • 8 • 2 stair-lab/VSMEC Viewer • Updated Sep 1, 2024 • 6.24k • 15 stair-lab/ViHSD Viewer • Updated Sep 1, 2024 • 30.7k • 16 stair-lab/VSFC Viewer • Updated Sep 1, 2024 • 14.6k • 6
Cultural Alignment akhilayerukola/NormAd Viewer • Updated Oct 25, 2024 • 2.63k • 247 • 4 ura-hcmut/ECLeKTic Preview • Updated Jun 5, 2025 • 19 • 1 ToxicityPrompts/PolyGuardPrompts Viewer • Updated Jun 23, 2025 • 29.3k • 381 • 3 SALT-NLP/CultureBank Viewer • Updated Apr 24, 2024 • 23k • 122 • 17
Reliable and Efficient Amortized Model-Based Evaluation Datasets and Models for the REEval project stair-lab/reeval Viewer • Updated Jun 21, 2025 • 5.69M • 68 • 1 stair-lab/reeval-difficulty-for-helm Viewer • Updated Mar 18, 2025 • 217k • 39 • 1
Nonmyopic Bayesian Optimization in Dynamic Cost Settings Datasets and Models for the Nonmyopic BO project stair-lab/semi_synthetic_protein_2p12_gemma_7b Viewer • Updated Dec 18, 2024 • 12.3k • 8 stair-lab/proteinea_fluorescence-embedding Viewer • Updated Dec 18, 2024 • 188k • 40
Gathering Context for Decision Support with LLMs stair-lab/bosd_initial_dataset Viewer • Updated Jan 7, 2025 • 568 • 6
Finetuning and Comprehensive Evaluation of Vietnamese LLM stair-lab/MATH_vi Viewer • Updated Sep 1, 2024 • 25k • 8 • 2 stair-lab/VSMEC Viewer • Updated Sep 1, 2024 • 6.24k • 15 stair-lab/ViHSD Viewer • Updated Sep 1, 2024 • 30.7k • 16 stair-lab/VSFC Viewer • Updated Sep 1, 2024 • 14.6k • 6
Dynamics of Learning Datasets and Models for the CodeInsights Projects stair-lab/code_insights_jsons Preview • Updated Dec 11, 2024 • 9 stair-lab/code_insights_csv Viewer • Updated about 10 hours ago • 3.07M • 22 • 1 stair-lab/code_insights_matrices Preview • Updated Dec 12, 2024 • 7 stair-lab/code-insights-llm_simulator Text Generation • 8B • Updated Sep 8, 2024
Cultural Alignment akhilayerukola/NormAd Viewer • Updated Oct 25, 2024 • 2.63k • 247 • 4 ura-hcmut/ECLeKTic Preview • Updated Jun 5, 2025 • 19 • 1 ToxicityPrompts/PolyGuardPrompts Viewer • Updated Jun 23, 2025 • 29.3k • 381 • 3 SALT-NLP/CultureBank Viewer • Updated Apr 24, 2024 • 23k • 122 • 17