Diagnosing LLM Performance Issues
Based on the description of the training data, what is the most likely underlying reason for the AI assistant's poor performance on its intended tasks? Explain your reasoning.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A research team fine-tunes a large language model on an extensive dataset containing hundreds of thousands of examples. The dataset is exclusively composed of well-structured problems, such as summarizing scientific articles, translating legal texts, and answering questions based on encyclopedia entries. The team then deploys this model as a general-purpose chatbot for public use. Which of the following scenarios most accurately predicts the chatbot's likely performance?
Diagnosing LLM Performance Issues
Evaluating a Dataset for a Real-World AI Assistant