1Cademy - Analyzing a Models Command Interpretation Failure

Learn Before

Compositional Generalization in LLMs

Case Study

Analyzing a Model's Command Interpretation Failure

A language model is trained to convert simple commands into action sequences. During its training, it correctly processes commands such as 'jump', 'run', 'jump twice', and 'run thrice'. When tested on the new command 'run twice', a command it has never seen before, it produces the incorrect sequence: 'RUN, JUMP, JUMP'. Based on this information, analyze the most probable cause of this specific failure.

Updated 2025-10-04

Contributors are:

Who are from:

Learn Before

Related