1Cademy - A language model architecture is designed to process a query by using two parallel computational streams: one that computes attention over a local memory of recent context, and another that searches an external datastore for relevant information. Match each architectural component with its primary function in this process.

Learn Before

k-NN Search Augmented Attention

Matching

A language model architecture is designed to process a query by using two parallel computational streams: one that computes attention over a local memory of recent context, and another that searches an external datastore for relevant information. Match each architectural component with its primary function in this process.

Updated 2025-10-06

Contributors are:

Who are from:

Learn Before

Related