1Cademy - Impact of Incorrect Ground-Truth Labels

Learn Before

Definition of c_gold

Essay

Impact of Incorrect Ground-Truth Labels

A dataset for a translation task contains the following training sample, formatted as 'input → target':

translate English to Spanish: The cat is black → El perro es negro

Analyze the potential negative impact on a model's learning process if it is trained on a significant number of such samples where the target text is an incorrect translation of the input text. What specific incorrect associations might the model learn from this example?

Updated 2025-10-08

Contributors are: