Short Answer

Analysis of Denoising Training Objectives

An encoder-decoder model is trained on a denoising task where corrupted text is provided as input. One training approach requires the decoder to reconstruct the entire original, uncorrupted text. A second, 'span-based' approach requires the decoder to generate only the text that was masked in the input, preceded by special tokens indicating which mask is being filled. Explain the key difference in the target sequence length for the decoder in the span-based approach and identify a primary computational benefit of this method.

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science