Partial translations of the FLORES(+) dataset and translations into non-textual modalities (speech, ASL).
AI & ML interests
Multilingual NLP, underserved languages
Recent Activity
Organization Card
Open Language Data Initiative
Welcome!
The Open Language Data Initiative (OLDI) empowers language communities around the globe to contribute to a database that drives the foundation of today’s machine translation and natural language processing work. We invite community, academic, and industry members to contribute to key datasets that are imperative to the organic expansion of language technology’s reach.
For more information, visit oldi.org.
models 0
None public yet