Activity (Process)

Initial Batch Creation in Continuous Batching

The continuous batching process begins with the creation of an initial batch. This batch is assembled from one or more input sequences, with its size and composition determined by the inference engine's available processing capacity and the current queue of user requests. After formation, this batch is dispatched to the inference engine to begin processing.

0

1

Updated 2026-05-06

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences