Learn Before
A language model is designed using a feedforward network architecture. It is trained to predict the next word by looking at a fixed-size window of the N preceding words (e.g., N=4). What is the most significant architectural limitation of this approach for modeling language?
0
1
Tags
Data Science
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Advantages vs. Disadvantages
Mini example
Bengio et al. (2003) Feed-Forward Neural Language Model
A language model is designed using a feedforward network architecture. It is trained to predict the next word by looking at a fixed-size window of the N preceding words (e.g., N=4). What is the most significant architectural limitation of this approach for modeling language?
Consider a feedforward neural network designed to predict the next word based on a fixed window of the three preceding words. Arrange the following computational steps in the correct order, from initial input to final output.
Function of Word Vector Representations