Concept

Retrieval-Based Methods as a Solution for Long-Context Processing

To overcome the difficulty of training Transformers on very long sequences, non-parametric approaches like retrieval-based methods can be used. These methods leverage an external memory, such as a vector database of key-value pairs, to represent the context, thereby avoiding the need to process the entire long context directly within the Transformer architecture.

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences