1Cademy - Token Deletion as an Input Corruption Method

Learn Before

Input Corruption Methods for Denoising Autoencoder Training

Concept

Token Deletion as an Input Corruption Method

Token deletion is an input corruption technique where certain tokens are randomly selected from an input sequence and then completely removed. This method is distinct from token masking, as the selected tokens are not replaced with a special [MASK] symbol but are instead deleted from the sequence, resulting in a shorter input.