1Cademy - Diagnosing LLM Performance Issues

Learn Before

Characteristics and Limitations of Early Instruction Fine-Tuning Datasets

Case Study

Diagnosing LLM Performance Issues

Based on the description of the training data, what is the most likely underlying reason for the AI assistant's poor performance on its intended tasks? Explain your reasoning.

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science

A research team fine-tunes a large language model on an extensive dataset containing hundreds of thousands of examples. The dataset is exclusively composed of well-structured problems, such as summarizing scientific articles, translating legal texts, and answering questions based on encyclopedia entries. The team then deploys this model as a general-purpose chatbot for public use. Which of the following scenarios most accurately predicts the chatbot's likely performance?
Diagnosing LLM Performance Issues
Evaluating a Dataset for a Real-World AI Assistant

Learn Before

Related