1Cademy - Time to First Token (TTFT)

Learn Before

Efficiency Metrics for LLM Evaluation

Definition

Time to First Token (TTFT)

Time to First Token (TTFT) is an efficiency metric that measures the duration from when a request is sent to an LLM to when the first token of the response is generated. When data transmission time is minimal, TTFT primarily reflects the time required for prefilling the context and predicting the initial token.

Updated 2026-05-05

Contributors are: