General Equivalence of Modified RoPE
A modified Rotary Positional Embedding function, denoted Ro', can be defined through its equivalence to the original Ro function. Applying the modified function Ro' to a token embedding with position parameters is identical to applying the original function Ro to the same embedding but with a transformed set of position parameters, . This relationship is formally stated as:
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Learn After
Formula for RoPE with Linear Positional Interpolation
A researcher defines a new rotary position embedding function,
Ro_new, for a tokenx_iat positioni. The new function is defined asRo_new(x_i, iθ) = Ro(x_i, (i+c)θ), whereRois the original function andcis a constant offset. According to the general equivalence principle, this can be written asRo_new(x_i, iθ) = Ro(x_i, iθ'). What is the correct expression for the transformed position parameteriθ'?RoPE Scaling Transformation Equivalence
Equivalence of RoPE Modification Strategies
Analysis of a Flawed RoPE Modification