Learn Before
Refining a Specialized Code Generation Model
An AI development team has fine-tuned a large language model to be an expert assistant for the Python programming language. However, they observe that the model frequently provides code snippets in other languages like JavaScript or C++ when prompted with ambiguous queries. A senior engineer suggests a further adaptation step using a new dataset that includes not only more Python examples but also examples of JavaScript queries explicitly marked as 'out of scope'. Explain the reasoning behind why introducing this more varied data set could help the model become more specialized in Python.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A development team has trained a language model to function as a specialized chatbot for booking restaurant reservations. After the initial training, they find that the model often answers questions about recipes or restaurant reviews, deviating from its core task. Which of the following strategies is most likely to effectively steer the model back to its intended specialized function?
Refining a Specialized Legal LLM
Refining a Specialized Code Generation Model