1Cademy - A language model encodes token positions by applying a unique, position-dependent rotational transformation to each tokens initial embedding. The final, position-aware embedding for a token is the result of this transformation. If the exact same token (e.g., model) appears at position 4 and later at position 12 in a sequence, which statement best describes the relationship between their final embeddings, $\mathbf{e}_4$ and $\mathbf{e}

Learn Before

Application of RoPE to Token Embeddings

Multiple Choice

A language model encodes token positions by applying a unique, position-dependent rotational transformation to each token's initial embedding. The final, position-aware embedding for a token is the result of this transformation. If the exact same token (e.g., 'model') appears at position 4 and later at position 12 in a sequence, which statement best describes the relationship between their final embeddings, $\mathbf{e}_4$ and $\mathbf{e}_{12}$ ?

Updated 2025-09-29

Contributors are:

Who are from:

Learn Before

Related