1Cademy - Defining the Candidate Set in Top-K Decoding

Learn Before

Formula for the Candidate Set in Top-K Decoding

Short Answer

Defining the Candidate Set in Top-K Decoding

A language model is generating a sequence. At step i=4, the shared preceding sequence is ('The', 'cat', 'sat'). The model has identified the top-K next tokens (where K=3) as y_4^top1 = 'on', y_4^top2 = 'by', and y_4^top3 = 'near'. Using standard set notation, write out the complete candidate set, denoted as Y_4.

Updated 2025-10-08

Contributors are:

Who are from:

Learn Before

Related