Multiple Choice

An inference server needs to process 12 independent user requests. The server's hardware has two processing options:

  1. Sequential Processing: Handle one request at a time, with each request taking 2 seconds to complete.
  2. Batched Processing: Group 4 requests into a single batch and process them in parallel, with the entire batch taking 3 seconds to complete.

Based on this information, which statement correctly analyzes the total time required and the resulting efficiency of each approach?

0

1

Updated 2025-10-07

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science