1Cademy - An LLM inference system is processing a batch of user requests. An observer notes the following: At the start of one processing step, the active batch contains requests {A, B, C, D}. Immediately before the next processing step begins, the active batch contains requests {A, C, E}. Based on this observation, what is the most fundamental principle of this systems batch management strategy?

Learn Before

Scheduler-Driven Batch Adjustments Between Iterations in Continuous Batching

Multiple Choice

An LLM inference system is processing a batch of user requests. An observer notes the following: At the start of one processing step, the active batch contains requests {A, B, C, D}. Immediately before the next processing step begins, the active batch contains requests {A, C, E}. Based on this observation, what is the most fundamental principle of this system's batch management strategy?

Updated 2025-09-28

Contributors are:

Who are from:

Learn Before

Related