1Cademy - LLM Inference Request Processing

Learn Before

Iteration in Continuous Batching

Case Study

LLM Inference Request Processing

After an inference system has completed a total of 12 computational iterations since the requests arrived, what is the status of each request? Specifically, state whether each request is complete and, if not, how many output tokens have been generated for it. Justify your reasoning based on how iterations are defined for the input processing and output generation phases.

Updated 2025-10-03

Contributors are:

Who are from:

Learn Before

Related