1Cademy - A research team is fine-tuning a language model for a text summarization task. The model uses a positional encoding scheme where a scalar hyperparameter, β, adjusts the strength of a distance-based bias in the attention mechanism. The team experiments with different values for β and records the models performance on a validation set using the ROUGE score (higher is better). The results are as follows: | β Value | ROUGE Score | |---------|-------------| | 0.01 | 0.35 | | 0.1

Learn Before

Tuning the ALiBi Bias Scalar ( $\beta$ )

Multiple Choice

A research team is fine-tuning a language model for a text summarization task. The model uses a positional encoding scheme where a scalar hyperparameter, β, adjusts the strength of a distance-based bias in the attention mechanism. The team experiments with different values for β and records the model's performance on a validation set using the ROUGE score (higher is better). The results are as follows:

β Value	ROUGE Score
0.01	0.35
0.1	0.4

Updated 2025-10-10

Contributors are:

Who are from:

Learn Before

Related