Formula

RoPE Scaling Transformation Equivalence

The scaling of Rotary Positional Embeddings (RoPE) can be conceptualized as a transformation of the rotation angle. A scaled rotation function, Ro\mathrm{Ro}', applied to an embedding xi\mathbf{x}_i with an original angle θ\theta, is equivalent to applying the original rotation function, Ro\mathrm{Ro}, with a transformed angle θ\theta'. This equivalence is captured by the formula: Ro(xi,iθ)=Ro(xi,iθ)\mathrm{Ro}'(\mathbf{x}_i, i\theta) = \mathrm{Ro}(\mathbf{x}_i, i\theta') This principle demonstrates that adapting RoPE for different sequence lengths is achieved by adjusting the rotation angles applied to the embeddings.

Image 0

0

1

Updated 2026-04-29

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.2 Generative Models - Foundations of Large Language Models

Related
Learn After