Formula

Offset Calculation for T5 Bias

Within the T5 relative positional embedding framework, the directional distance between a query vector qi\mathbf{q}_i located at index ii and a key vector kj\mathbf{k}_j located at index jj is measured by their offset. This query-key offset, symbolized as d(i,j)d(i, j), is calculated by subtracting the key's position from the query's position:

d(i,j)=ijd(i, j) = i - j

Image 0

0

1

Updated 2026-04-23

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related