Diagnosing Inconsistent Fine-Tuning Performance
Based on the case study below, provide the most likely explanation for the model's inconsistent performance. Your explanation should focus on the relationship between a model's initial, extensive training phase and the subsequent specialized tuning process.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Primary Source of Out-of-Distribution Generalization: Pre-training vs. Fine-tuning
Diagnosing Inconsistent Fine-Tuning Performance
A development team is fine-tuning a large, pre-trained language model to act as a specialized legal assistant. They notice that the model quickly masters tasks related to contract law after seeing only a few examples, but struggles to generate accurate summaries of intellectual property case law, even with a large number of fine-tuning examples. What is the most likely underlying reason for this discrepancy?
The 'Unknown Unknowns' of Fine-Tuning Strategy