Learn Before
FIRE Positional Bias Formula
The positional bias in the FIRE method is calculated using the following formula. It involves a function applied to the relative distance , which is then normalized by the function's value at the maximum of a given length (mlen) and the current position i. The entire expression is scaled by a factor :
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Computing Sciences
Foundations of Large Language Models Course
Related
FIRE Positional Bias Formula
A self-attention mechanism is designed so that the positional influence on the attention score between any two tokens depends only on their relative distance, not their absolute locations. For instance, the positional adjustment between the 3rd and 7th tokens is identical to the adjustment between the 23rd and 27th tokens. Which of the following techniques directly implements this principle?
Analyzing the Functional Approach to Positional Bias
An LLM architect is designing a self-attention mechanism where the positional influence between any two tokens is calculated directly as a bias in the attention score. The core design principle is that this bias must be determined by a specific, continuous mathematical function that takes only the relative distance between the tokens as its input. Which of the following implementation strategies directly realizes this design principle?
Learn After
A method for calculating positional bias uses the formula: where
iis the current token's position,jis another token's position,mlenis a fixed length parameter,fis a scaling factor, andψis a monotonically increasing function (i.e., its value increases as its input increases). How does the normalization term in the denominator,ψ(max(mlen, i)), affect the calculated bias values as the current positionigrows significantly larger thanmlen?Positional Bias Calculation
Analyzing Positional Bias Behavior