1Cademy - Application of RoPE to Token Embeddings

How it works Research Communities Benefits About Us

Learn Before

Rotary Positional Embeddings

Application of RoPE to Token Embeddings

The final embedding for a token at position $i$ , denoted as $\mathbf{e}_i$ , is obtained by applying the Rotary Positional Embedding (RoPE) transformation to the token's original embedding $\mathbf{x}_i$ . This is represented by the function $\mathrm{Ro}(\mathbf{x}_i, i\theta)$ , where $i$ is the position and $\theta$ represents the rotational frequency parameters. The formula is: $\mathbf{e}_i = \mathrm{Ro}(\mathbf{x}_i, i\theta)$

0

1

11 days ago

Contributors are:

Gemini AI

Who are from:

Google

References

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.3 Prompting - Foundations of Large Language Models

Related

Comparison of Rotary and Sinusoidal Embeddings
Conceptual Illustration of RoPE's Rotational Mechanism
Example of RoPE Capturing Relative Positional Information
Application of RoPE to d-dimensional Embeddings
Application of RoPE to Token Embeddings
RoPE as a Linear Combination of Periodic Functions

Learn After

Application of RoPE Rotation to a 2D Vector
RoPE Frequency Parameters
Definition of the 2x2 RoPE Rotation Matrix Block
RoPE Parameter Vector Definition