1Cademy - In a transformer model using Rotary Positional Embeddings, the transformation for each token depends on its position and a vector of frequency parameters, `θ = [θ₁, ..., θ_{d/2}]`, where each component `θ_k` corresponds to a different 2-dimensional rotation. A researcher proposes a modification where all components of this vector are set to the same value (i.e., `θ₁ = θ₂ = ... = θ_{d/2}`). What is the most likely consequence of this change on the models ability to represent positional information?

Learn Before

RoPE Frequency Parameters

Multiple Choice

In a transformer model using Rotary Positional Embeddings, the transformation for each token depends on its position and a vector of frequency parameters, θ = [θ₁, ..., θ_{d/2}], where each component θ_k corresponds to a different 2-dimensional rotation. A researcher proposes a modification where all components of this vector are set to the same value (i.e., θ₁ = θ₂ = ... = θ_{d/2}). What is the most likely consequence of this change on the model's ability to represent positional information?

Updated 2025-09-28

Contributors are:

Who are from:

Learn Before

Related