Case Study

Fine-Tuning Performance Analysis

A data science team starts with a large language model that excels at summarizing general news articles. They then adapt this model by training it exclusively on a large dataset of legal contracts to create a specialized legal summarizer. The new model achieves state-of-the-art performance on summarizing legal documents. However, when they later evaluate this specialized model on its original task of summarizing news articles, they find its performance has significantly degraded; the summaries are often convoluted and miss the main points. Analyze the most probable cause for this performance degradation on the original task.

0

1

Updated 2025-10-01

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science