Essay

Evaluating the 'Arg Max' Prediction Strategy

A common strategy for generating text with a probabilistic model is to always choose the single most likely output, a process formally described as y^=arg maxy Pr(yx)\hat{\mathbf{y}} = \underset{\mathbf{y}}{\text{arg max}} \ \text{Pr}(\mathbf{y}|\mathbf{x}). Evaluate this strategy. Discuss one significant advantage and one significant disadvantage of strictly adhering to this rule for tasks like creative writing or chatbot conversations.

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.5 Inference - Foundations of Large Language Models

Evaluation in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science