1Cademy - Document Rotation as an Input Corruption Method

Learn Before

Input Corruption Methods for Multi-Sentence Sequences

Concept

Document Rotation as an Input Corruption Method

Document rotation is an input corruption method where the primary objective is for a model to identify the original start of a sequence. The process begins by randomly selecting a token from the input text. The entire sequence is then rotated so that this selected token is positioned at the beginning, creating a corrupted version. The model is then trained on this rotated sequence to predict which token was originally the first one.

Updated 2026-04-17

Contributors are: