Dataset

SCAN Benchmark

The SCAN (Simplified versions of the CommAI Navigation tasks) benchmark is a set of tasks created to test a Large Language Model's capacity for compositional generalization. These tasks require the model to translate natural language instructions into a corresponding sequence of actions.

0

1

Updated 2025-10-04

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Computing Sciences