Concept

Retrieval-Augmented Generation (RAG)

Retrieval-Augmented Generation (RAG) is a context scaling approach that dynamically incorporates timely or specialized external knowledge into a large language model's input. RAG systems first retrieve relevant document snippets from a large collection or database based on the current query. These retrieved pieces of information are then added to the prompt context, grounding the model's predictions in specific external sources to generate responses that are both relevant to the input and factually up-to-date.

Image 0

0

1

Updated 2026-05-06

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.3 Prompting - Foundations of Large Language Models

Ch.5 Inference - Foundations of Large Language Models

Related
Learn After