Learn Before
Autoregressive Limitation of GPT
Due to the autoregressive nature of its language modeling objective, the Generative Pre-Training (GPT) model processes text strictly in a forward, left-to-right direction. Consequently, a word's representation is determined solely by the context to its left. For example, in the phrases 'i went to the bank to deposit cash' and 'i went to the bank to sit down', the left context for the word 'bank' is identical in both cases. Because GPT cannot look ahead to the rightward context that differentiates the intended meaning, it returns the exact same representation for 'bank' in both sentences, exposing a critical limitation in its context sensitivity.
0
1
Tags
D2L
Dive into Deep Learning @ D2L
Related
A foundational generative language model introduced in 2018 significantly improved the ability to capture relationships between words far apart in a text, a major challenge for previous sequential models. Which of the following best analyzes the core architectural innovation responsible for this leap in performance?
Critique of an Early Transformer-Based Language Model
Training Objective of an Early Transformer Model
GPT-2
Similarities Between BERT and GPT in Fine-Tuning
Autoregressive Limitation of GPT