1Cademy - End-of-Sequence (EOS) Token as a Stopping Criterion

Learn Before

Stopping Criteria in LLM Inference

Concept

End-of-Sequence (EOS) Token as a Stopping Criterion

A common and straightforward stopping strategy in LLM inference is to terminate the generation process upon the production of a special end-of-sequence (EOS) token, such as ⟨EOS⟩ or ⟨/s⟩. Models are specifically trained to output this token to indicate that the generated text is complete.

Updated 2026-05-05

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

A developer is using a text-generation model to complete the sentence: 'The capital of France is'. The model produces the single word 'Paris' and then immediately stops. The developer had configured the generation process to allow for a maximum of 100 new words and is surprised by the short output. Based on how these models are trained to signal completeness, what is the most likely reason the generation process terminated after just one word?
Consequences of Training Data Omissions
Debugging Premature Text Generation Termination

Learn Before

Related

Learn After