Formula

Equation for Matching Periods in RoPE Base Scaling

To determine the scaling factor λ\lambda for RoPE base scaling, the period of the last dimension (lowest frequency) in the new model (with scaled base λb\lambda b) is set equal to the period of the linear positional interpolation model. This constraint is expressed by the following equation: 2π(λb)2(d21)d=mml2πb2(d21)d{}2\pi \cdot (\lambda b)^{\frac{2(\frac{d}{2}-1)}{d}} = \frac{m}{m_l} \cdot 2\pi \cdot b^{\frac{2(\frac{d}{2}-1)}{d}} where mm is the new sequence length, mlm_l is the original length, and dd is the embedding dimensionality.

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.2 Generative Models - Foundations of Large Language Models

Related
Learn After