Short Answer

Defining the Candidate Set in a Greedy Decoding Strategy

A language model is generating a sequence using a decoding strategy where, at each step, it only pursues the single most likely option. After step i-1, the single best sequence is y_1...y_{i-1}. At step i, the model computes the probabilities for the next token, finding that p(token_A | y_1...y_{i-1}) = 0.6, p(token_B | y_1...y_{i-1}) = 0.3, and p(token_C | y_1...y_{i-1}) = 0.1. Formally define the candidate set of sequences, Y_i, at the end of step i and explain the reasoning behind its composition.

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science