Short Answer

Analyzing a Formalism for Token Selection

A researcher attempts to formalize the process of selecting the K most probable next tokens with the following expression: S = argMax_{y_i ∈ V} Pr(y_i|x, y_{<i}), where S is the set of selected tokens and V is the vocabulary. Identify the primary error in this formulation for a scenario where K > 1, and explain why it is incorrect.

0

1

Updated 2025-10-04

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science