Activity (Process)

Removing Completed Sequences in Continuous Batching

A key dynamic adjustment made by the scheduler in continuous batching is the removal of completed sequences from the active batch. Once a sequence finishes its generation, typically signaled by an end-of-sequence token, it is immediately removed. This action, performed between iterations, frees up computational resources for new or ongoing requests.

0

1

Updated 2026-05-06

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences