Formula

Formula for Scaled RoPE Frequency Parameters (θ')

When adapting Rotary Positional Embeddings (RoPE) for different sequence lengths, a new vector of frequency parameters, θ\theta', is defined. The components of this vector are calculated using a scaling factor λ\lambda, a base term bb, and the embedding dimension dd. The formula for the vector is: θ=[(λb)0d,(λb)2d,,(λb)d2d]\theta' = [(\lambda b)^{-\frac{0}{d}}, (\lambda b)^{-\frac{2}{d}}, \dots, (\lambda b)^{-\frac{d-2}{d}}]

Image 0

0

1

Updated 2026-04-29

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.2 Generative Models - Foundations of Large Language Models