Diagnosing a Repetitive Generation Loop
A developer is using an autoregressive language model to generate a story. They observe that the model is stuck in a repetitive loop. Based on the generation log below, diagnose the most likely error in the developer's implementation of the generation process. Explain your reasoning.
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
An autoregressive language model is generating text. It has been provided the initial context 'The sun is shining and the'. In its first step, it predicts the next word will be 'sky'. To generate the word that comes after 'sky', what sequence will the model use as its new input?
An autoregressive language model is given the initial context 'The old clock ticked'. It then generates the next two tokens, which are the word 'loudly' followed by a period '.'. Arrange the following four actions into the correct chronological sequence that describes this two-step generation process.
Diagnosing a Repetitive Generation Loop