Case Study

Evaluating Model Performance via Loss

Based on the provided case study, which model (A or B) would have a lower value for the loss function L(pθ,pgold)\mathcal{L}(\mathbf{p}^{\theta}, \mathbf{p}^{\text{gold}}), and why? Your explanation should connect the properties of the probability distributions to the purpose of the loss function in model training.

0

1

Updated 2025-10-03

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science