1Cademy - Designing a Long-Context Retrieval Experiment

Learn Before

Needle-in-a-Haystack and Passkey Retrieval Tasks

Essay

Designing a Long-Context Retrieval Experiment

You are tasked with comparing the long-context retrieval capabilities of two new large language models, Model A and Model B. Design an experiment using the 'needle-in-a-haystack' methodology to determine which model performs better. Your experimental design should describe:

The structure of the input documents you would create (the 'haystack').
The specific piece of information you would embed (the 'needle').
The prompt you would use to query the models.
The key metric(s) you would use to measure and compare their performance across multiple trials.

Updated 2025-10-10

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences