Critiquing a Fine-Tuning Strategy
A developer fine-tunes a large, pre-trained language model on a small, specialized dataset. After training, they find the model achieves near-perfect accuracy on the specialized task but has significantly lost its ability to answer general questions it previously handled well. Based on the principles of parameter adjustment during fine-tuning, critique the developer's likely training approach and explain why this performance trade-off occurred.
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A machine learning engineer is adapting a large, pre-trained language model for a highly specialized task using a small dataset. They choose an aggressive training strategy that results in substantial changes to the model's parameters from their initial, pre-trained values. Which of the following outcomes represents the most significant risk of this approach?
Diagnosing Fine-Tuning Performance Degradation
Critiquing a Fine-Tuning Strategy