Case Study

Diagnosing LLM Performance Issues

A technology company has successfully developed a large language model after an extensive initial training phase on a massive dataset of text and code from the internet. During internal testing, the team observes two primary issues:

  1. When given a direct command like 'Summarize the following paragraph about photosynthesis,' the model often continues the paragraph with more facts about photosynthesis rather than providing a summary.
  2. When asked open-ended questions like 'What are some creative and safe activities for a children's party?,' the model's suggestions are sometimes impractical or ignore the 'safe' constraint.

Based on these observations, which major development stage should the company focus on next to address these specific shortcomings? Justify your answer by explaining how the components of this stage would resolve the two observed issues.

0

1

Updated 2025-10-10

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.4 Alignment - Foundations of Large Language Models

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science