1Cademy - SCAN Benchmark

Learn Before

Compositional Reasoning Tasks for LLMs

Dataset

SCAN Benchmark

The SCAN (Simplified versions of the CommAI Navigation tasks) benchmark is a set of tasks created to test a Large Language Model's capacity for compositional generalization. These tasks require the model to translate natural language instructions into a corresponding sequence of actions.

Updated 2025-10-04

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

Example of a SCAN Task

Learn Before

Related

Learn After