1Cademy - Positional Encoding

Learn Before

Transformer Encoder Stack

Concept

Positional Encoding

Unlike Recurrent Neural Networks (RNNs) that process tokens sequentially one-by-one, self-attention ditches sequential operations in favor of parallel computation and does not naturally preserve the order of the input sequence. To address this order-insensitivity, the dominant approach is to represent the sequence order as an additional input associated with each token, called a positional encoding. These encodings can be either learned during training or fixed a priori.

Updated 2026-06-27

Contributors are:

Who are from:

References