Diagnosing and Correcting a Fine-Tuning Process
Based on the scenario provided, identify the two primary issues the fine-tuned model is exhibiting. Then, propose one specific strategy the team could implement in their next training run to address these issues and explain why this strategy would be effective.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Diagnosing and Correcting a Fine-Tuning Process
Evaluating a Flawed Fine-Tuning Strategy
A development team is fine-tuning a large, pre-trained language model on a specialized dataset of medical research papers. They observe that while the model's performance on medical queries is excellent, it has started to perform poorly on simple, general-knowledge questions it could previously answer correctly. Which of the following adjustments to the fine-tuning process is the most direct and effective strategy to address this degradation in general capabilities?
A machine learning engineer is fine-tuning a pre-trained language model and observes several undesirable behaviors. Match each observed behavior with the most appropriate mitigation strategy.