Short Answer

Critique of an Objective Function Formulation

A researcher proposes the following objective function to train a language model, where p_model(y|x) is the model's probability distribution and f(x, y) is an arbitrary scoring function that is not a probability distribution: Objective = E [log p_model(y|x) - f(x, y)]. From a conceptual standpoint, what is the primary weakness of formulating the objective in this way?

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science