1Cademy - Fine-Tuning Performance Degradation

Learn Before

Risk of Overfitting and Catastrophic Forgetting in SFT

Case Study

Fine-Tuning Performance Degradation

Based on the scenario described, analyze the two primary, interconnected phenomena that are most likely responsible for the observed change in the model's behavior. Explain how each phenomenon contributes to the final outcome.

Updated 2025-09-28

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science

Strategies to Mitigate Overfitting and Catastrophic Forgetting in SFT
Fine-Tuning Performance Degradation
A team fine-tunes a large, pre-trained language model, known for its strong general knowledge, on a highly specialized dataset of legal contracts. They train the model for a very large number of iterations. After fine-tuning, the model demonstrates exceptional performance in generating and interpreting legal text but now provides nonsensical or incorrect answers to simple, general knowledge questions it could easily answer before. What is the most likely explanation for this change in the model'
The Interplay of Overfitting and Knowledge Loss in Model Tuning

Learn Before

Related