1Cademy - Objective Function for Fine-Tuning a Strong LLM with Weak Supervision

Learn Before

Weak-to-Strong Generalization via Fine-Tuning on Weak Model Data

Formula

Objective Function for Fine-Tuning a Strong LLM with Weak Supervision

The process of fine-tuning a strong Large Language Model using synthetic data generated by a weak model can be mathematically formalized. Given a collection of inputs $X$ , where each input $\mathbf{x} \in X$ includes an instruction and any necessary user input, a weak LLM denoted by $\Pr^{w}(\cdot)$ generates a prediction $\hat{\mathbf{y}} = \arg\max_{\mathbf{y}} \Pr^{w}(\mathbf{y}|\mathbf{x})$ . The strong LLM, denoted by $\mathrm{Pr}^{s}_{\theta}(\cdot)$ , is then trained on these predictions. The objective is to find the optimal model parameters $\tilde{\theta}$ that maximize the log-probability of the weak model's generated predictions: $\tilde{\theta} = \arg\max_{\theta} \sum_{\mathbf{x} \in X} \log \mathrm{Pr}_{\theta}^{s}(\hat{\mathbf{y}}|\mathbf{x})$ .

0

1

Updated 2026-05-01

Contributors are:

Who are from:

References

Learn Before

Related

Learn After