Learn Before
Contrasting Decoding Objectives
Imagine a text generation system has produced several potential output sentences. One decoding strategy selects the single sentence with the highest calculated probability. An alternative strategy selects the sentence that is, on average, most similar to all other high-probability sentences. Contrast these two strategies. Which one is more likely to avoid unusual or outlier generations, and why?
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Choosing a Decoding Strategy for a Translation Service
A language model is generating a summary of a document. It produces a set of candidate summaries. Candidate A has the highest individual probability (30%) but is an outlier, sharing very little phrasing with other candidates. Candidates B, C, and D have lower individual probabilities (20% each) but are all very similar to one another, forming a clear consensus. Which of the following statements most accurately predicts the behavior of two different decoding strategies?
Contrasting Decoding Objectives