Transferring a Specialized Skill
Based on the provided scenario, outline the specific, step-by-step process for fine-tuning the large model. Your explanation should detail how the training signal is generated and what it is used for in each step of the process.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Combined Loss Objective in Weak-to-Strong Training
A team is fine-tuning a large, powerful model to perform a specific task. Instead of using a dataset with pre-defined correct answers, they use a smaller, weaker model as a live supervisor. For each input, the large model generates an output, and the weaker model also generates an output. A loss value is then calculated based on the difference between these two outputs. What is the direct and immediate purpose of this calculated loss value within the training loop?
Transferring a Specialized Skill