Multiple Choice

A team is training a compact 'student' model to emulate a powerful 'teacher' model. The training objective is to minimize a loss function that measures the divergence between the probability distributions of the student model's outputs and the teacher model's outputs for a given set of inputs. What is the primary goal of this optimization process?

0

1

Updated 2025-09-28

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science