1Cademy - Using Reference Tokens to Define a Vocabulary Distribution in k-NN LM

Learn Before

Retrieving Reference Tokens in k-NN LM Inference

Concept

Using Reference Tokens to Define a Vocabulary Distribution in k-NN LM

A prevalent strategy for utilizing the retrieved reference tokens in k-NN Language Models involves creating a new probability distribution over the vocabulary. This distribution is derived from the nearest neighbors and serves to guide the model's final prediction by incorporating context from the datastore.

Updated 2026-05-02

Contributors are: