1Cademy - A development team is fine-tuning a very large, powerful language model. Instead of using human-labeled data, they use a much smaller, less capable model to generate labels for a vast dataset. The training objective is to make the large models predictions match the small models labels as closely as possible, viewing the process as a transfer of knowledge from the small model to the large one. Based on this methodology, what is the most significant potential pitfall?

Learn Before

Weak-to-Strong Fine-Tuning as a Knowledge Distillation Problem

Multiple Choice

A development team is fine-tuning a very large, powerful language model. Instead of using human-labeled data, they use a much smaller, less capable model to generate labels for a vast dataset. The training objective is to make the large model's predictions match the small model's labels as closely as possible, viewing the process as a transfer of 'knowledge' from the small model to the large one. Based on this methodology, what is the most significant potential pitfall?

Updated 2025-09-28

Contributors are:

Who are from:

Learn Before

Related