Concept

Document Rotation as an Input Corruption Method

Document rotation is an input corruption method where the primary objective is for a model to identify the original start of a sequence. The process begins by randomly selecting a token from the input text. The entire sequence is then rotated so that this selected token is positioned at the beginning, creating a corrupted version. The model is then trained on this rotated sequence to predict which token was originally the first one.

0

1

Updated 2026-04-17

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences