Multiple Choice

A research team is developing a system to answer questions based on a large document. Instead of feeding the entire document into a language model for every question, they want to learn a compressed, continuous representation of the document (a 'soft prompt', σ). Their process is as follows:

  1. First, for a given question (z), they run the model with the full document to get a high-quality, 'gold standard' answer (ŷ).
  2. Next, they try to find the optimal soft prompt (σ) that, when paired with the original question (z), causes the model to produce that same 'gold standard' answer (ŷ).

They define the 'optimal' soft prompt as the one that makes the probability of generating the 'gold standard' answer as high as possible. Based on this optimization strategy, which statement best describes the primary goal?

0

1

Updated 2025-09-28

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science

Related