Comparison

Similarity of ALiBi Positional Biases to Length Features

The functional form of the right-hand side of the ALiBi (Attention with Linear Biases) equation is very similar to length features utilized in conventional feature-based systems. For instance, in statistical machine translation systems, such length features are extensively used to model word reordering problems, resulting in models that can generalize well across different translation tasks.

0

1

Updated 2026-04-24

Contributors are:

Who are from:

Tags

Foundations of Large Language Models

Ch.2 Generative Models - Foundations of Large Language Models

Computing Sciences

Foundations of Large Language Models Course