Learn Before
Short Answer

Analyzing Model Performance Scaling

Imagine you are analyzing the performance of two different text-processing architectures, Model A and Model B. You plot their processing time against the length of the input text. The resulting graph shows that for short texts (under 500 tokens), Model A is slightly faster. However, as the text length increases to several thousand tokens, Model B's processing time increases dramatically and becomes much slower than Model A's, whose processing time increases at a much steadier, slower rate. Based on these performance characteristics, which model (A or B) likely employs an architecture that scales linearly with sequence length? Justify your answer by explaining the relationship between the observed performance and the underlying computational complexity.

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science