icon epic arrow
icon epic arrow
TAUS Quality Estimation Benchmarking Report
March 2025

This report presents the results of a benchmarking exercise, outlining our testing methodology, the training process, and all the key insights gained from analyzing the correlation between QE scores and human evaluations.

icon epic arrow

Contents of the Report

The Setup - Defining the Benchmarking Framework and Evaluation Process

The Results - Correlation between QE Scores and Human Labels

Key Takeaways - Strengths & Limitations

Case Studies

icon epic arrow
Authors
David Koot
ML Engineer | TAUS
icons-social-media-linked-in-circle
Anne-Maj van der Meer
Head of Sales & Marketing | TAUS
icons-social-media-linked-in-circle