1Cademy - T5 Bias for Relative Positional Embedding

Learn Before

Relative Positional Encoding as a Query-Key Bias
Shared Learnable Bias per Offset

Concept

T5 Bias for Relative Positional Embedding

The T5 bias, introduced by Raffel et al. (2020), is an advanced approach that generalizes the concept of offset-specific biases. To address the generalization problem of assigning a unique parameter to every offset, T5 groups various query-key offsets into a limited number of 'buckets.' Each bucket is then associated with a single, shared learnable parameter, enabling the model to handle a wide range of relative positions, including those not seen during training.