1Cademy - Kerple Logarithmic Bias Formula

Learn Before

Kerple

Formula

Kerple Logarithmic Bias Formula

The Kerple method for positional bias can be implemented using a logarithmic function to penalize attention based on token distance. For a query at position $i$ and a key at position $j$ , the bias is calculated with the formula: $\mathrm{PE}(i,j) = -\beta_1 \log(1 + \beta_2(i - j))$ . Here, $\beta_1$ and $\beta_2$ are hyperparameters controlling the scale and shape of the logarithmic penalty.