1Cademy - A researcher is fine-tuning a model for a question-answering task. The model processes a question and a context paragraph to predict the start and end positions of the answer within the paragraph. After training, the researcher observes a specific performance issue: the model consistently identifies the correct *end* token of the answer span, but frequently selects an incorrect *start* token. Based on the typical architecture for this task where separate predictions are made for the start and en

Learn Before

Illustration of BERT-based Architecture for Span Prediction

Multiple Choice

A researcher is fine-tuning a model for a question-answering task. The model processes a question and a context paragraph to predict the start and end positions of the answer within the paragraph. After training, the researcher observes a specific performance issue: the model consistently identifies the correct end token of the answer span, but frequently selects an incorrect start token. Based on the typical architecture for this task where separate predictions are made for the start and en

Updated 2025-09-26

Contributors are:

Who are from:

Learn Before

Related