1Cademy - Definition of Students Probability Distribution (P

Learn Before

Teacher-Student Model Architecture in Knowledge Distillation

Formula

Definition of Student's Probability Distribution (P_theta^s)

The student model's output probability distribution, parameterized by $\theta$ , is denoted by $P_{\theta}^{s}$ . This distribution is formally defined as the conditional probability of an output given a context $\mathbf{c}'$ and a latent variable $\mathbf{z}$ . The mathematical expression is $P_{\theta}^{s} = \text{Pr}_{\theta}^{s}(\cdot|\mathbf{c}', \mathbf{z})$ .