1Cademy - An AI development team uses a two-stage system for a text generation task. First, a base generator creates a list of 10 possible outputs. Second, a separate scoring component reranks these 10 outputs to select the best one. The team investigates a case where the system produced a poor final output and makes the following observations: 1. The final output selected by the scoring component was nonsensical. 2. A manual review of the initial 10 generated outputs reveals that one of them was a high-quality, correct response. 3. The scoring component assigned a very low score to this high-quality response. Based on these observations, what is the most likely source of the systems failure in this specific case?

Learn Before

Use of Reranking to Explore Model and Search Errors

Multiple Choice

An AI development team uses a two-stage system for a text generation task. First, a base generator creates a list of 10 possible outputs. Second, a separate scoring component reranks these 10 outputs to select the best one. The team investigates a case where the system produced a poor final output and makes the following observations:

The final output selected by the scoring component was nonsensical.
A manual review of the initial 10 generated outputs reveals that one of them was a high-quality, correct response.
The scoring component assigned a very low score to this high-quality response.

Based on these observations, what is the most likely source of the system's failure in this specific case?

0

1

Updated 2025-10-03

Contributors are:

Who are from:

Learn Before

Related