Comparison

Simultaneous vs. Sequential Phases in Continuous and Standard Batching

A key difference between continuous and standard batching methods lies in how they execute the prefilling and decoding phases. In continuous batching, prefilling and decoding can occur simultaneously across different sequences within the active batch. Conversely, in standard batching, these two phases must be performed sequentially for the entire batch before moving on.

0

1

Updated 2026-05-06

Contributors are:

Who are from:

Tags

Foundations of Large Language Models

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences