Evaluating a Chatbot's Training Limitations
A company develops a customer support chatbot. They train a general-purpose language model on a large, high-quality dataset consisting of thousands of ideal customer questions and perfectly crafted, helpful, and safe responses. After this training phase, the company deploys the chatbot. While it performs well on common queries, it is soon discovered that when users ask complex, multi-part questions or express frustration in unusual ways, the chatbot occasionally provides factually incorrect information or responds with a subtly unhelpful, dismissive tone. Based on this scenario, evaluate the company's training approach and explain the most likely reason for the chatbot's undesirable behaviors.
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Learning from Human Feedback
A development team trains a large language model on a vast dataset of high-quality, curated instruction-and-response pairs to create a helpful chatbot. After this training, they observe that while the model answers most questions correctly, it occasionally generates responses that are subtly biased or confidently presents outdated, incorrect information when faced with novel or ambiguous user queries. Which of the following statements best analyzes the fundamental limitation demonstrated by the model's behavior?
Evaluating a Chatbot's Training Limitations
Analyzing Model Behavior After Instruction-Based Training