Concept

Top-k Sampling

Top-k sampling is a decoding strategy where, at each step of the text generation process, the next token is selected by sampling from a reduced set of candidates. This set is limited to the 'k' tokens that have the highest predicted probabilities.

Image 0

0

1

Updated 2026-05-05

Tags

Data Science

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Learn After