Multiple Choice

A team is comparing two text generation systems to produce a 10-token sequence.

  • System A generates tokens one after another. The computation for each token takes 100ms.
  • System B is a hypothetical system that can compute all 10 tokens simultaneously, with each token's computation also taking 100ms.

Why does System A take approximately 10 times longer than System B to produce the full sequence?

0

1

Updated 2025-10-05

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science