Multiple Choice

In a sequence processing model, a positional bias is calculated to penalize attention scores based on the distance between tokens. The formula used is Bias = -β ⋅ (i - j), where i is the query position, j is the key position, and β is a fixed scalar. If the query token is at position 5, the key token is at position 2, and β = 0.1, what is the calculated bias value?

0

1

Updated 2025-09-28

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science