1Cademy - Similarity of ALiBi Positional Biases to Length Features

Learn Before

ALiBi Bias Term Formula

Comparison

Similarity of ALiBi Positional Biases to Length Features

The functional form of the right-hand side of the ALiBi (Attention with Linear Biases) equation is very similar to length features utilized in conventional feature-based systems. For instance, in statistical machine translation systems, such length features are extensively used to model word reordering problems, resulting in models that can generalize well across different translation tasks.

Updated 2026-04-24

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn Before

Related