Concept

Input Corruption Methods for Denoising Autoencoder Training

When training encoder-decoder models with a denoising autoencoding objective, various methods can be used to corrupt the input data. This process is crucial for training the model to reconstruct the original input. Besides the common technique of masking tokens, other corruption strategies include altering tokens to different ones or reordering them within the sequence.

0

1

Updated 2026-04-16

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences