Spaces:
Running
Running
Commit History
Upload from GitHub Actions: Merge pull request #18 from datenlabor-bmz/pr-17 a0d1624 verified
Upload from GitHub Actions: Add auto-translated datasets c790fdb verified
Upload from GitHub Actions: Update evaluation results f88768f verified
Upload from GitHub Actions: Update evaluation results 95c4e14 verified
Upload from GitHub Actions: ran full evaluation locally 088f96f verified
Upload from GitHub Actions: restored model.json d380f79 verified
Upload from GitHub Actions: updated and cleaned up scripts for new eval runs 963cb78 verified
Upload from GitHub Actions: Update models.py, models.json, and results.json with latest evaluation data and model additions 8eebb41 verified
Upload from GitHub Actions: Merge pull request #9 from datenlabor-bmz/jn-dev 7c06aef verified
Upload from GitHub Actions: Merge pull request #7 from datenlabor-bmz/jn-dev 6878a71 verified
Upload from GitHub Actions: Get more results, compute average based on all tasks 98c6811 verified
Upload from GitHub Actions: Correlation plot b0aa389 verified
Upload from GitHub Actions: Evaluate Google Translate 338dc9b verified
Upload from GitHub Actions: More models and languages a73f888 verified
Upload from GitHub Actions: Merge remote changes and apply terminology updates: Commercial->closed-source, Open->open-source ebaf279 verified
Upload from nightly evaluation run c3be561 verified
Upload from GitHub Actions: More results 52abc5b verified
Upload from GitHub Actions: Update model ranking fetching f840423 verified
Upload from GitHub Actions: Use FLORES+ via Huggingface 913253a verified
Upload from nightly evaluation run 7fce0be verified
Upload from nightly evaluation run 7e8d13c verified
Upload from nightly evaluation run 9ee89ef verified
Upload from nightly evaluation run 8a4050a verified
Upload from nightly evaluation run 1d4c8a4 verified
Upload from GitHub Actions: New results b311dd5 verified
Upload from nightly evaluation run 47bcf10 verified
Upload from nightly evaluation run dcb356d verified
Block gemini-2.5-pro-exp-03-25 092c06a
David Pomerenke commited on
Only run tasks for which there is no result yet 2f9dee1
David Pomerenke commited on