1Cademy - A model designer implements a positional bias using the formula `Bias = -βlog(1 + β)`, where `β` is a positive value that increases with token distance. The goal is to penalize attention to more distant tokens. By mistake, the designer forgets the leading negative sign, implementing the formula as `Bias = βlog(1 + β)`. What is the most likely effect of this error on the models behavior?

Learn Before

Kerple Logarithmic Bias Formula

Multiple Choice

A model designer implements a positional bias using the formula Bias = -βlog(1 + β), where β is a positive value that increases with token distance. The goal is to penalize attention to more distant tokens. By mistake, the designer forgets the leading negative sign, implementing the formula as Bias = βlog(1 + β). What is the most likely effect of this error on the model's behavior?

Updated 2025-10-04

Contributors are:

Who are from:

Learn Before

Related