1Cademy - Input Corruption Methods for Denoising Autoencoder Training

Input to Encoder: &quot;The scientist carefully [MASK] the solution into the beaker.&quot;
Target Output for Decoder: &quot;The scientist carefully poured the solution into the beaker.&quot;

Learn Before

Training Encoder-Decoder Models with a Denoising Autoencoding Objective

Concept

Input Corruption Methods for Denoising Autoencoder Training

When training encoder-decoder models with a denoising autoencoding objective, various methods can be used to corrupt the input data. This process is crucial for training the model to reconstruct the original input. Besides the common technique of masking tokens, other corruption strategies include altering tokens to different ones or reordering them within the sequence.

Updated 2026-04-16

Contributors are: