Concept

Span-Based Denoising as an Encoder-Decoder Training Objective

In a span-based denoising task for an encoder-decoder model, the training objective is to reconstruct only the original text from masked spans. The model's encoder processes an input sequence where one or more spans of text have been replaced by unique mask or sentinel tokens. The decoder is then trained to generate a sequence containing these sentinel tokens paired with the original text they replaced, effectively learning to 'fill in the blanks'. A loss function is computed by comparing the generated text with the ground-truth masked spans.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences