TAUS Webinar - Optimize Your Training Data

EPIC

Resources

Optimize Your Training Data

23 March 2021

5:00 - 6:00 pm CEST

Like most machine learning applications, to get intelligent results from machine translation tools you need training data.

We have reached the conclusion that more data is not always better. Instead of massive amounts of data, we need high-quality data, clustered for specific domains and content types

Watch recording

Agenda

The ideal query corpus

Tips on training data optimization and evaluation

Where can you find high-quality training data?

Use case presentations by Lilt and SYSTRAN