Short Answer

Choosing a Training Objective for Error Detection

A researcher wants to train a language model to be highly effective at identifying the specific location of grammatical errors within long sentences. They are considering two self-supervised training objectives:

Objective 1: The model reads an entire sentence and outputs a single label: 'grammatically correct' or 'contains an error'.

Objective 2: The model reads an entire sentence and, for each individual word, outputs a label: 'correct' or 'incorrect'.

Which objective is more suitable for the researcher's goal? Justify your answer by explaining the difference in the supervision signal provided by each approach.

0

1

Updated 2025-10-03

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science