Example

Needle-in-a-Haystack and Passkey Retrieval Tasks

The 'needle-in-a-haystack' and passkey retrieval tasks are synthetic evaluation methods that assess an LLM's ability to retrieve information from long contexts. The model is tasked with identifying and extracting a small, relevant piece of information that is intentionally hidden within a large volume of irrelevant text. The core assumption tested is that a model with effective long-context memory can remember details from early in the text while processing subsequent information, enabling it to locate sparse details.

0

1

Updated 2026-04-29

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.2 Generative Models - Foundations of Large Language Models