Short Answer

Analyzing LLM Performance

An AI development team is evaluating their new instruction-tuned language model. They observe two distinct behaviors:

  1. The model excels at summarizing scientific articles, even those from fields it wasn't explicitly trained on, as long as the instruction is "summarize this text."
  2. The model struggles when given a mix of instructions it has seen before, such as "translate this sentence," "write a poem," and "explain this concept," often confusing the required output formats.

Based on this scenario, identify which type of generalization the model demonstrates well and which type it lacks. Briefly justify your answer for each.

0

1

Updated 2025-10-10

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science