Example

Example of Initial Batch Creation in Continuous Batching

An illustration of the initial step in continuous batching involves the scheduler receiving new requests, such as 'x1' and 'x2'. In the first iteration of the process, these requests are formed into an initial batch and sent to the inference engine to begin the prefilling phase.

0

1

Updated 2025-10-09

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences