1Cademy - Analyzing Prediction Outcomes via Neighbor Distances

Learn Before

Aggregated Distance Calculation for k-NN Vocabulary Distribution

Case Study

Analyzing Prediction Outcomes via Neighbor Distances

A language model retrieves the 5 nearest neighbors from its datastore to help predict the next word. The retrieved items, consisting of a token value and its distance from the current context, are listed below. Based on this information, analyze and explain why the model's final prediction is more likely to be 'ocean' than 'sea'. Your explanation must focus on how the aggregated distance for each of these two specific words is determined from the neighbors, assuming the aggregated distance for a given word is the minimum distance found among the neighbors with that word as their value.

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Learn Before

Related