1Cademy - Uniqueness of RoPE-based Embeddings

Learn Before

Application of RoPE to Token Embeddings

Short Answer

Uniqueness of RoPE-based Embeddings

A language model generates a final, position-aware embedding, $\mathbf{e}_i$ , by applying a rotational transformation to a token's initial embedding, $\mathbf{x}_i$ , based on its position, $i$ . The process is described by the function $\mathbf{e}_i = \mathrm{Ro}(\mathbf{x}_i, i\theta)$ . If two different tokens (with distinct initial embeddings $\mathbf{x}_A$ and $\mathbf{x}_B$ ) are located at the same position $p$ , is it possible for them to have identical final embeddings (i.e., $\mathbf{e}_A = \mathbf{e}_B$ )? Explain your reasoning based on the properties of a rotational transformation.

0

1

Updated 2025-10-04

Contributors are:

Who are from:

Learn Before

Related