1Cademy - Distinguishing `max` and `argmax` in Candidate Selection

Learn Before

Argmax Formula for Best Candidate Selection in BoN Sampling

Short Answer

Distinguishing max and argmax in Candidate Selection

A language model generates several candidate responses for a given prompt, and a reward model assigns a quality score to each. Explain the key difference between what the max function and the argmax operator would return if applied to this set of scored candidates.

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences