1Cademy - Example of the Second Decoding Step in Continuous Batching (Iteration 3)

Learn Before

Example of the First Decoding Step in Continuous Batching (Iteration 2)

Example

Example of the Second Decoding Step in Continuous Batching (Iteration 3)

This diagram illustrates the third iteration in the continuous batching example, continuing from the first decoding step. In this stage, the scheduler again directs the inference engine to perform a single decoding operation for the batch containing requests x1 and x2. This action generates the second output token for each of the two requests, demonstrating the ongoing, iterative nature of the decoding phase.