Case Study

Analyzing Model Performance on Unseen Sequence Lengths

A team of developers trains a sequence model to predict the next number in a series. The model is trained exclusively on sequences that follow a simple increasing pattern, with a maximum length of 500 tokens. During testing, the model performs with near-perfect accuracy on new sequences up to 500 tokens long. However, when tested on a sequence of 750 tokens that follows the same increasing pattern, the model's predictions become erratic and incorrect after the 500th token. Based on this scenario, diagnose the specific type of generalization failure the model is exhibiting and explain why its performance degraded.

0

1

Updated 2025-10-04

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science