1Cademy - Next Token Selection in k-NN Language Models

Learn Before

k-NN LM Interpolation Formula

Activity (Process)

Next Token Selection in k-NN Language Models

After computing the interpolated final probability distribution, $\mathrm{Pr}(\cdot|\mathbf{h}_i)$ , a $k$ -nearest neighbors ( $k$ -NN) language model selects the next token, $y$ . This selection is achieved by finding the specific token that maximizes the final probability, $\mathrm{Pr}(y|\mathbf{h}_i)$ , thereby choosing the most probable output based on the combined retrieval and model predictions.