Short Answer

Analyzing the Components of a Model Mimicry Loss Function

In the context of training a smaller model to mimic a larger one, the training objective is to minimize the loss function, formally expressed as Loss(Prt(),Prθs(),x)Loss(\text{Pr}^t(\cdot|\cdot), \text{Pr}_{\theta}^s(\cdot|\cdot), \mathbf{x}). Identify which component of this function is directly adjusted during the training process and explain why the other components are considered fixed.

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.3 Prompting - Foundations of Large Language Models

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science