Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Open Language Data Initiative

community
https://oldi.org/
openlanguagedata
Activity Feed

AI & ML interests

Multilingual NLP, underserved languages

Recent Activity

cointegrated  updated a dataset 8 days ago
openlanguagedata/flores_plus
cointegrated  new activity 8 days ago
openlanguagedata/flores_plus:Add Khakas data (kjh_Cyrl)
jeanma  new activity 10 days ago
openlanguagedata/oldi_seed:JSONL conversion
View all activity

Laurie Burchell's profile pictureJean's profile pictureSkyler Wang's profile pictureDavid Dale's profile pictureIsaac Caswell's profile picture

openlanguagedata 's collections 1

OLDI and friends
This collection groups the datasets that have been featured as part of WMT’s Open Language Data Initiative shared task.
  • openlanguagedata/flores_plus

    Viewer • Updated 8 days ago • 893k • 25.1k • 116
  • openlanguagedata/oldi_seed

    Viewer • Updated 10 days ago • 564k • 1.48k • 10
  • google/smol

    Viewer • Updated Oct 31, 2025 • 798k • 1.68k • 91
  • google/wmt24pp

    Viewer • Updated Jan 22 • 54.9k • 5.48k • 84
OLDI and friends
This collection groups the datasets that have been featured as part of WMT’s Open Language Data Initiative shared task.
  • openlanguagedata/flores_plus

    Viewer • Updated 8 days ago • 893k • 25.1k • 116
  • openlanguagedata/oldi_seed

    Viewer • Updated 10 days ago • 564k • 1.48k • 10
  • google/smol

    Viewer • Updated Oct 31, 2025 • 798k • 1.68k • 91
  • google/wmt24pp

    Viewer • Updated Jan 22 • 54.9k • 5.48k • 84
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs