1Cademy - A research team is adapting a large, powerful language model (the strong model) for a specialized task. They lack a large set of human-verified labels, but they have a smaller, less accurate model (the weak model) that can generate plausible, albeit imperfect, labels. The teams strategy is to use the weak model to label a large unlabeled dataset and then fine-tune the strong model to mimic the weak models labeling behavior on this dataset. Which of the following mathematical objectives best represents the goal of finding the optimal strong model parameters, $\tilde{\theta}$, that maximize the strong models ability to predict the labels, $\hat{\mathbf{y}}$, generated by the weak model for a given set of inputs, $\mathbf{x}$?

Learn Before

Objective Function for Fine-Tuning a Strong LLM with Weak Supervision

Multiple Choice

A research team is adapting a large, powerful language model (the 'strong model') for a specialized task. They lack a large set of human-verified labels, but they have a smaller, less accurate model (the 'weak model') that can generate plausible, albeit imperfect, labels. The team's strategy is to use the weak model to label a large unlabeled dataset and then fine-tune the strong model to mimic the weak model's labeling behavior on this dataset. Which of the following mathematical objectives best represents the goal of finding the optimal strong model parameters, $\tilde{\theta}$ , that maximize the strong model's ability to predict the labels, $\hat{\mathbf{y}}$ , generated by the weak model for a given set of inputs, $\mathbf{x}$ ?

0

1

Updated 2025-09-28

Contributors are:

Who are from:

Learn Before

Related