Learn Before
A team fine-tunes a pre-trained language model on a dataset consisting of 50,000 examples. Every example in the dataset follows the same format: an instruction to 'Summarize the following article,' followed by a long news article and a professionally written one-paragraph summary. After training, the model is tested with a new instruction: 'Summarize the following article,' followed by a news article it has never seen before. Which outcome would best demonstrate that the model has successfully generalized from its training?
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A team fine-tunes a pre-trained language model on a dataset consisting of 50,000 examples. Every example in the dataset follows the same format: an instruction to 'Summarize the following article,' followed by a long news article and a professionally written one-paragraph summary. After training, the model is tested with a new instruction: 'Summarize the following article,' followed by a news article it has never seen before. Which outcome would best demonstrate that the model has successfully generalized from its training?
Evaluating Model Generalization
Analyzing a Model's Performance Failure