Learn Before
Short Answer

Effect of Modifying RoPE Frequency Parameters

Consider two transformer models using Rotary Positional Embeddings (RoPE). Model A uses a standard vector of frequency parameters, θ = [θ₁, ..., θ_{d/2}], to control the rotations. Model B uses a modified vector where every frequency is doubled: θ' = [2θ₁, ..., 2θ_{d/2}]. How does this modification in Model B affect the model's encoding of relative positions between tokens compared to Model A? Explain your reasoning.

0

1

Updated 2025-10-03

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science