Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Open Language Data Initiative

community
https://oldi.org/
openlanguagedata
Activity Feed

AI & ML interests

Multilingual NLP, underserved languages

Recent Activity

cointegrated  new activity 10 days ago
openlanguagedata/flores_plus:Information on the Lombard data
cointegrated  updated a dataset about 1 month ago
openlanguagedata/flores_plus
cointegrated  new activity about 1 month ago
openlanguagedata/flores_plus:Add Mauritian Creole
View all activity

David Dale's profile picture Laurie Burchell's profile picture Isaac Caswell's profile picture Jean's profile picture Skyler Wang's profile picture

openlanguagedata 's collections 1

OLDI and friends
This collection groups the datasets that have been featured as part of WMT’s Open Language Data Initiative shared task.
  • openlanguagedata/flores_plus

    Viewer • Updated Nov 23, 2025 • 887k • 9.8k • 99
  • openlanguagedata/oldi_seed

    Viewer • Updated Nov 6, 2025 • 564k • 1.14k • 10
  • google/smol

    Viewer • Updated Oct 31, 2025 • 798k • 3.79k • 80
  • google/wmt24pp

    Viewer • Updated Mar 13, 2025 • 54.9k • 4.43k • 74
OLDI and friends
This collection groups the datasets that have been featured as part of WMT’s Open Language Data Initiative shared task.
  • openlanguagedata/flores_plus

    Viewer • Updated Nov 23, 2025 • 887k • 9.8k • 99
  • openlanguagedata/oldi_seed

    Viewer • Updated Nov 6, 2025 • 564k • 1.14k • 10
  • google/smol

    Viewer • Updated Oct 31, 2025 • 798k • 3.79k • 80
  • google/wmt24pp

    Viewer • Updated Mar 13, 2025 • 54.9k • 4.43k • 74
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs