Definition

Tokens Per Second (TPS)

Tokens Per Second (TPS) is an efficiency metric that measures a model's generation rate by quantifying the number of tokens it can produce per second.

0

1

Updated 2026-05-05

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related