Diagnosing Language Model Output
Instead of providing a summary, the model generates the following text: '...which consists of the Sun and the objects that orbit it, including eight planets and their moons.' Based on the model's described training, explain the fundamental reason for the discrepancy between the developer's instruction and the model's actual output.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Instruction Alignment
A user interacts with a large language model that has only undergone its initial training phase on a vast corpus of text, without any subsequent fine-tuning to follow commands. The user provides the input: 'Translate the following sentence into French:'. Which of the following outputs is most characteristic of this specific type of model's behavior?
Diagnosing Language Model Output
Predicting Pre-trained Model Behavior