1Cademy - Corrupted Input for Encoder-Decoder Pre-training

Input to Encoder: &quot;The scientist carefully [MASK] the solution into the beaker.&quot;
Target Output for Decoder: &quot;The scientist carefully poured the solution into the beaker.&quot;

Learn Before

Training Encoder-Decoder Models with a Denoising Autoencoding Objective

Concept

Corrupted Input for Encoder-Decoder Pre-training

When pre-training an encoder-decoder model using either BERT-style or denoising autoencoding methods, the initial step involves processing data through the encoder. The input provided to the encoder is a corrupted token sequence where some specific tokens have been intentionally masked and replaced with a special placeholder, such as [MASK] (or [M] for short).