Learn Before
Concept

Increased Scheduling Complexity in Chunked Prefilling

A significant trade-off of chunked prefilling is the introduction of greater scheduling complexity. Unlike standard prefilling where an entire sequence is a single task, the chunk-based approach requires the scheduler to manage a larger number of smaller, more granular tasks for each sequence. This adds computational overhead to the scheduling process.

0

1

Updated 2026-05-06

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences