A large language model's ability to answer factual questions about history is a direct result of a separate training phase focused specifically on memorizing historical facts, distinct from its primary language modeling task.
0
1
Tags
Ch.1 Pre-training - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A common observation is that large language models, despite being trained only to predict the next token in a sequence, can perform tasks that seem to require genuine world knowledge. What is the primary reason for this emergent capability?
The Emergence of Knowledge from a Simple Objective
A large language model's ability to answer factual questions about history is a direct result of a separate training phase focused specifically on memorizing historical facts, distinct from its primary language modeling task.