Learn Before
Local vs. Global Optimality in Text Generation
A text generation model builds its output by selecting the single most probable word at each step. Explain why this step-by-step approach is not guaranteed to find the single output sequence with the highest overall probability.
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A language model is tasked with generating a 50-word summary. The stated goal is to find the single sequence of 50 words that has the highest overall probability given the input text. Why is it computationally infeasible to guarantee finding this exact sequence in practice?
Selecting a Text Generation Objective
Local vs. Global Optimality in Text Generation