Multiple Choice

An engineer is training a small 'student' model by learning from a larger 'teacher' model. The training objective is to find the student parameters (θ) that maximize a combined score, formulated as: score=(Term AλTerm B)\text{score} = \sum (\text{Term A} - \lambda \cdot \text{Term B}) where 'Term A' measures how well the student predicts the correct, ground-truth answers, and 'Term B' measures how closely the student's outputs match the teacher's outputs. After training, the engineer notices the student model is replicating systematic errors present in the teacher model, leading to poor performance on a validation set. Which adjustment to the hyperparameter λ is the most appropriate first step to address this issue?

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science

Related