Formula

Definition of Student's Probability Distribution (PθsP_\theta^s)

In the context of knowledge distillation, PθsP_\theta^s denotes the probability distribution of the student model's output. This distribution is conditional on a given context c\mathbf{c}' and a latent variable z\mathbf{z}, and is parameterized by the student model's weights θ\theta. The relationship is formally expressed by the equation: Pθs=Prθs(c,z)P_\theta^s = \text{Pr}_\theta^s(\cdot|\mathbf{c}', \mathbf{z}).

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related