Learn Before
Evaluating Candidate Text Sequences
A text generation system is evaluating two possible three-word sequences. For each sequence, it has calculated a confidence score for each word, denoted by for the -th word. The system uses a function, , to calculate a total score for the entire sequence. A higher total score is considered better. Given the scores below, which sequence should the system prefer, and what is its total score? Explain your reasoning.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
In a system that generates a sequence of 'k' items, let each ȳᵢ (for i from 1 to k) represent a summary of the system's state after generating the i-th item. A scoring function is then defined as s(ȳ₁, ..., ȳₖ), which outputs a single numerical value. Based on this structure, what is the primary role of the function 's'?
Interpreting a Sequence Evaluation Function
Evaluating Candidate Text Sequences