Definition

Throughput

Throughput is an efficiency metric that measures the processing capacity of a Large Language Model. It is typically quantified as the number of tokens or requests the model can handle per second.

0

1

Updated 2026-05-05

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related