Concept

T5 Bias for Relative Positional Embedding

The T5 bias, introduced by Raffel et al. (2020), is an advanced approach that generalizes the concept of offset-specific biases. To address the generalization problem of assigning a unique parameter to every offset, T5 groups various query-key offsets into a limited number of 'buckets.' Each bucket is then associated with a single, shared learnable parameter, enabling the model to handle a wide range of relative positions, including those not seen during training.

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related
Learn After