Formula

Formula for Scaling the Period in Position Interpolation

To adjust the period of the sine and cosine functions in Rotary Positional Embeddings for longer sequences, one approach is to scale up the period TkT_k by a factor of m/mlm/m_l. The adjusted period is given by: Tk=mml2πb2(k1)dT'_k = \frac{m}{m_l} \cdot 2\pi \cdot b^{\frac{2(k-1)}{d}} where mm is the new sequence length, mlm_l is the original sequence length, and bb is the base.

Image 0

0

1

Updated 2026-04-29

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.2 Generative Models - Foundations of Large Language Models