1Cademy - Evaluating Objective Function Designs

Learn Before

Preference for Divergence-Based Objective Functions

Case Study

Evaluating Objective Function Designs

Two researchers are developing an objective function to train a language model. Researcher A proposes minimizing an objective that represents the difference between the log-probability of the model's distribution and the log-probability of the target data distribution. Researcher B proposes minimizing an objective that represents the difference between the log-probability of the model's distribution and an arbitrary scoring function that is not based on a probability distribution. Analyze the two proposed objective functions. Which one is conceptually preferable and why? Explain the key advantage of the preferred structure.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Learn Before

Related