Learn Before
Diagnosing a Model's Translation Error
A language translation model is designed to process all words in a sentence at the same time, rather than one after another. When translating the English sentence "The quick brown fox jumps over the lazy dog," it produces the nonsensical German output "Fuchs springt über Hund der faule braune schnelle der." The model correctly translated each word, but the order is completely scrambled. What specific type of information, which must be represented for each word, is this model likely failing to consider? Explain your reasoning.
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A sequence processing model is configured to represent all its internal data using 512-dimensional vectors. To understand the order of items, it generates a unique vector for each position in an input sequence. If the model is given a sequence with 40 items, what are the dimensions of the vector that represents the position of the 25th item?
Diagnosing a Model's Translation Error
Analyzing a Model's Order-Insensitivity