Concept

Overhead of Dynamic Batch Reorganization in Continuous Batching

A significant trade-off in continuous batching is the overhead associated with its dynamic nature. The scheduler must constantly reorganize batches by rearranging data in memory whenever requests are added or removed. This continuous reassessment and optimization of the batch structure incurs both computational and memory costs. These overheads can lead to negative consequences such as increased memory fragmentation and, in some situations, additional processing latency, which can counteract the throughput gains.

0

1

Updated 2026-05-06

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related