1Cademy - Critique of an Objective Function Formulation

Learn Before

Preference for Divergence-Based Objective Functions

Short Answer

Critique of an Objective Function Formulation

A researcher proposes the following objective function to train a language model, where p_model(y|x) is the model's probability distribution and f(x, y) is an arbitrary scoring function that is not a probability distribution: Objective = E [log p_model(y|x) - f(x, y)]. From a conceptual standpoint, what is the primary weakness of formulating the objective in this way?

Updated 2025-10-02

Contributors are:

Who are from:

Learn Before

Related