Activity (Process)

Tuning the ALiBi Bias Scalar (β\beta)

In the ALiBi framework, the scalar hyperparameter β\beta, which dictates the magnitude of the positional penalty applied to query-key products, is generally optimized by tuning its value on a validation dataset to discover the most effective setting for a specific task.

0

1

Updated 2026-04-24

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences