Consequences of Specialized Fine-Tuning
A development team takes a large language model, which is highly proficient at general conversation, and fine-tunes it exclusively on a dataset of their company's technical product manuals to create a specialized support bot. Describe a likely negative impact on the model's original conversational abilities and explain the core reason for this phenomenon.
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Catastrophic Forgetting in Fine-Tuning
Fine-Tuning Performance Analysis
A research team starts with a large language model pre-trained on a massive, diverse text corpus, which shows strong performance across a general language understanding benchmark. They then fine-tune this model on a small, highly specialized dataset for classifying medical research abstracts. After fine-tuning, the model achieves 99% accuracy on the medical abstract test set, but when re-evaluated on the original general language benchmark, its performance has dropped by 20%. What is the most likely explanation for this outcome?
Consequences of Specialized Fine-Tuning