True/False

Consider the process of selecting the best output from a set of N candidates, where a reward model r scores each candidate ŷ_i based on an input x. The selection is represented by the formula: ŷ_best = max{r(x, ŷ_1), ..., r(x, ŷ_N)}. This formula implies that the final output, ŷ_best, is a numerical value representing the highest score.

0

1

Updated 2025-10-04

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science