Multiple Choice

A team is training a soft prompt (σ) to help a language model generate a specific, high-quality target sentence (ŷ) when given an input (z). They are considering two different optimization objectives:

  • Objective 1: Adjust the soft prompt σ to maximize the probability of the model generating the exact target sentence ŷ.
  • Objective 2: Adjust the soft prompt σ so that the model's entire probability distribution over the next possible word matches the distribution it would have had if it were conditioned on the full, original context instead of the prompt.

Which statement best evaluates the fundamental difference in what these two objectives are trying to achieve?

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Evaluation in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science

Related