Predicting Pre-trained Model Behavior
A large language model has been developed with a single training objective: to become highly proficient at predicting the most probable next word in a sequence of text. If a user provides this model with the input, 'Write a short story about a robot who discovers music.', what kind of output is the model most likely to produce? Explain your reasoning based on its training objective.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Instruction Alignment
A user interacts with a large language model that has only undergone its initial training phase on a vast corpus of text, without any subsequent fine-tuning to follow commands. The user provides the input: 'Translate the following sentence into French:'. Which of the following outputs is most characteristic of this specific type of model's behavior?
Diagnosing Language Model Output
Predicting Pre-trained Model Behavior