Learn Before
Comparison

Comparison of Rotary and Sinusoidal Embeddings

Rotary and sinusoidal positional embeddings share several key characteristics, yet differ fundamentally in their application. Both methods use hard-coded, non-learnable values to encode position, and the approach to setting frequency parameters is analogous in both. However, the primary distinction lies in their integration with token embeddings: sinusoidal embeddings are added to the token vectors, while rotary embeddings apply a rotational transformation, which is a multiplicative operation.

Image 0

0

1

Updated 2026-04-29

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.3 Prompting - Foundations of Large Language Models

Related