Case Study

Evaluating Objective Function Designs

Two researchers are developing an objective function to train a language model. Researcher A proposes minimizing an objective that represents the difference between the log-probability of the model's distribution and the log-probability of the target data distribution. Researcher B proposes minimizing an objective that represents the difference between the log-probability of the model's distribution and an arbitrary scoring function that is not based on a probability distribution. Analyze the two proposed objective functions. Which one is conceptually preferable and why? Explain the key advantage of the preferred structure.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science