Multiple Choice

A researcher is fine-tuning a model for a question-answering task. The model processes a question and a context paragraph to predict the start and end positions of the answer within the paragraph. After training, the researcher observes a specific performance issue: the model consistently identifies the correct end token of the answer span, but frequently selects an incorrect start token. Based on the typical architecture for this task where separate predictions are made for the start and end points, which component is the most likely source of this specific error pattern?

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science