1Cademy - Weak-to-Strong Generalization via Fine-Tuning on Weak Model Data

Learn Before

Utility of Weak Models in Assisting Stronger Models
Data Selection and Filtering Using Weak Models
Generating Synthetic Data with a Weak LLM for Instruction Fine-Tuning

Activity (Process)

Weak-to-Strong Generalization via Fine-Tuning on Weak Model Data

One approach to weak-to-strong generalization involves a two-stage process. First, a dataset is curated using a small, weak model. This can be done either by having the weak model generate labels for a set of inputs or by using it to select high-quality examples from a larger, pre-existing dataset. In the second stage, a large, strong model is fine-tuned on this curated dataset. The training objective is to minimize a loss function, such as a Knowledge Distillation (KD) loss, which measures the discrepancy between the strong model's outputs and the labels provided by the weak model in the dataset.