Multiple Choice

An AI development team uses a two-stage system for a text generation task. First, a base generator creates a list of 10 possible outputs. Second, a separate scoring component reranks these 10 outputs to select the best one. The team investigates a case where the system produced a poor final output and makes the following observations:

  1. The final output selected by the scoring component was nonsensical.
  2. A manual review of the initial 10 generated outputs reveals that one of them was a high-quality, correct response.
  3. The scoring component assigned a very low score to this high-quality response.

Based on these observations, what is the most likely source of the system's failure in this specific case?

0

1

Updated 2025-10-03

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Computing Sciences

Foundations of Large Language Models Course

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science