Analyzing Model Behavior After Instruction-Based Training
A team of developers has trained a large language model using a comprehensive dataset of high-quality instructions and their corresponding ideal responses. Despite this extensive training, they find the model sometimes generates factually incorrect or subtly biased answers. In two to three sentences, explain the primary reason why this training method alone is insufficient to prevent such undesirable outputs.
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Learning from Human Feedback
A development team trains a large language model on a vast dataset of high-quality, curated instruction-and-response pairs to create a helpful chatbot. After this training, they observe that while the model answers most questions correctly, it occasionally generates responses that are subtly biased or confidently presents outdated, incorrect information when faced with novel or ambiguous user queries. Which of the following statements best analyzes the fundamental limitation demonstrated by the model's behavior?
Evaluating a Chatbot's Training Limitations
Analyzing Model Behavior After Instruction-Based Training