Learn Before
A sequence-to-sequence model is translating a sentence from one language to another. At the third step of generating the output, it incorrectly predicts a word. Subsequent words generated by the model are also incorrect, leading to a final translation that is grammatically flawed and semantically nonsensical, even though the initial error was minor. Which characteristic of the generation process is the most likely cause of this cascading failure?
0
1
Tags
Ch.1 Pre-training - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A sequence-to-sequence model is translating a sentence from one language to another. At the third step of generating the output, it incorrectly predicts a word. Subsequent words generated by the model are also incorrect, leading to a final translation that is grammatically flawed and semantically nonsensical, even though the initial error was minor. Which characteristic of the generation process is the most likely cause of this cascading failure?
Diagnosing Long-Range Dependency Issues
Trade-offs in Translation Generation