A machine learning engineer is fine-tuning a pre-trained language model and observes several undesirable behaviors. Match each observed behavior with the most appropriate mitigation strategy.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Diagnosing and Correcting a Fine-Tuning Process
Evaluating a Flawed Fine-Tuning Strategy
A development team is fine-tuning a large, pre-trained language model on a specialized dataset of medical research papers. They observe that while the model's performance on medical queries is excellent, it has started to perform poorly on simple, general-knowledge questions it could previously answer correctly. Which of the following adjustments to the fine-tuning process is the most direct and effective strategy to address this degradation in general capabilities?
A machine learning engineer is fine-tuning a pre-trained language model and observes several undesirable behaviors. Match each observed behavior with the most appropriate mitigation strategy.