success stories
Inside the Human Language Project: A Local Data Collector's Journey
16/10/2023
Interview with Kathleen Kownacki: Bridging Language Divides as a Local Data Collector.
Human Language Project contributes to the digital representation of Igbo
09/08/2023
Participation in the language data projects offered by the Human Language Project leads to new professional opportunities.
Breaking the Publishing Ground: From Dictionaries to Linguistic Data
14/09/2021
Lexicala has emerged from the publishing world, as a provider of quality lexicographic content for leading dictionary publishers worldwide, and joined the TAUS Data Marketplace as a language data seller.
From Studying Medicine to Selling MT Training Datasets
14/05/2021
See how a medical doctor trying to break the linguistic monoculture in academia as a student ended up creating a dataset of medical glossaries and translation memories that brings about a 90% BLEU score improvement for the English and Arabic language pair.
To Sell More Data, Effortlessly.
22/04/2021
See how a niche data provider makes use of the promotional opportunity TAUS Data Marketplace creates to reach large data buyers.
Ethical Responsibility towards Underrepresented Languages
05/01/2021
See how a linguist specializing in low-resource Middle-Eastern languages mobilizes his native community to generate language data and by offering it on Data Marketplace manages to increase digital representation of these languages.
A Win-Win Situation
29/12/2020
See how EGO Translating Company became a data seller on TAUS Data Marketplace while looking to buy data. A great example for other LSPs for monetizing the translation data they have.
“We have stories to tell...”
22/12/2020
Translators of high and low resource languages now have the means to monetize their translations while contributing to digital representation of their languages.
Head Start Advantage with Early Adoption
22/12/2020
Joining the Data Marketplace ahead of other LSPs offers many advantages for establishing your company in the language data for AI space.