Debugging Premature Text Generation Termination
Based on the common method models use to signal the natural conclusion of a response, what is the most probable technical reason for this premature termination, and what aspect of the model's training data might have led to this behavior?
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A developer is using a text-generation model to complete the sentence: 'The capital of France is'. The model produces the single word 'Paris' and then immediately stops. The developer had configured the generation process to allow for a maximum of 100 new words and is surprised by the short output. Based on how these models are trained to signal completeness, what is the most likely reason the generation process terminated after just one word?
Consequences of Training Data Omissions
Debugging Premature Text Generation Termination