Learn Before
Function of a Sequence of Overlined Variables
A function, denoted as , that takes a sequence of variables, , as its input. The notation represents the -th variable in the sequence, where the bar (overline) can signify a specific property, such as being an average or a predicted value.

0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Related
Set of Indexed Key-Value Pairs
Set of Superscript-Indexed Vectors
Set of Key-Value Pairs
Function of a Sequence of Overlined Variables
Function of a Sequence of Averaged Vectors
Vector Slice Notation for a Sequence Window ()
Set of Sequential Vectors Notation
Vector Sequence Window Notation
Consider an autoregressive model generating a sequence of tokens one by one. At each step
i, the model calculates attention using the query from the current token and the keys and values from all tokens generated so far (from position 1 toi). To optimize this process, the model maintains a growing set of all previously computed key and value vectors. What is the primary computational advantage of this strategy?State of an Autoregressive Cache
An autoregressive language model with
τparallel computational units (e.g., attention heads) is generating a sequence of tokens. After computing the output for the 3rd token, the model stores the key and value vectors from all tokens processed so far to use in subsequent steps. Which of the following notations correctly represents the complete set of these stored key-value pairs at this specific moment?
Learn After
In a system that generates a sequence of 'k' items, let each ȳᵢ (for i from 1 to k) represent a summary of the system's state after generating the i-th item. A scoring function is then defined as s(ȳ₁, ..., ȳₖ), which outputs a single numerical value. Based on this structure, what is the primary role of the function 's'?
Interpreting a Sequence Evaluation Function
Evaluating Candidate Text Sequences