Short Answer

Impact of Scaling Factor on RoPE Frequencies

Consider the formula for scaled RoPE frequency parameters: θ=[(λb)0d,(λb)2d,,(λb)d2d]\theta' = [(\lambda b)^{-\frac{0}{d}}, (\lambda b)^{-\frac{2}{d}}, \dots, (\lambda b)^{-\frac{d-2}{d}}] Explain how decreasing the scaling factor λ (where 0 < λ < 1) affects the individual frequency components in the vector θ'. What is the corresponding effect on the rotational periods of the embeddings?

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science