SLED Generating Benchmarks for Factuality Evaluation of Language Models Paper • 2307.06908 • Published Jul 13, 2023 • 8
Generating Benchmarks for Factuality Evaluation of Language Models Paper • 2307.06908 • Published Jul 13, 2023 • 8
text generation microsoft/biogpt Text Generation • Updated Feb 3, 2023 • 233k • 300 microsoft/MediPhi Text Generation • 4B • Updated Dec 15, 2025 • 1.11k • 18 google/medgemma-4b-it Image-Text-to-Text • Updated Oct 28, 2025 • 173k • 911 mistralai/Mistral-7B-v0.3 7B • Updated Jul 24, 2025 • 805k • 566
SLED Generating Benchmarks for Factuality Evaluation of Language Models Paper • 2307.06908 • Published Jul 13, 2023 • 8
Generating Benchmarks for Factuality Evaluation of Language Models Paper • 2307.06908 • Published Jul 13, 2023 • 8
text generation microsoft/biogpt Text Generation • Updated Feb 3, 2023 • 233k • 300 microsoft/MediPhi Text Generation • 4B • Updated Dec 15, 2025 • 1.11k • 18 google/medgemma-4b-it Image-Text-to-Text • Updated Oct 28, 2025 • 173k • 911 mistralai/Mistral-7B-v0.3 7B • Updated Jul 24, 2025 • 805k • 566