1Cademy - Generalizable Positional Embeddings

Learn Before

Generalization Issues of Learnable Positional Embeddings

Concept

Generalizable Positional Embeddings

To overcome the limitations of fixed-length training, an alternative approach is to develop generalizable positional embeddings. Suppose an embedding model is trained on sequences with a maximum length of $m_l$ . If the model can generalize, it can be applied to handle much longer sequences of length $m$ (where $m \gg m_l$ ) during inference. This capability allows the model to extrapolate and effectively deal with new positions outside the range observed in the training data.