1Cademy - Most Likely Sequence in Sequence-to-Sequence Models

Learn Before

Sequence-to-Sequence Learning

Formula

Most Likely Sequence in Sequence-to-Sequence Models

In sequence generation tasks, the ultimate objective is typically to find the single most likely output sequence, which can differ from a sequence formed by simply selecting the most likely token at each individual step. If a sequence-to-sequence decoder accurately reflects the underlying generative process, the most likely translation is the complete sequence that maximizes the product of conditional probabilities over all time steps: $\prod_{t'=1}^{T'} P(y_{t'} \mid y_1, \ldots, y_{t'-1}, \mathbf{c})$ where $\mathbf{c}$ is the context vector. This expression represents the global optimal sequence based on the model's learned probability distribution.

Updated 2026-05-14

Contributors are:

Who are from:

References

Dive into Deep Learning

Learn After

Exhaustive Search Strategy in Sequence-to-Sequence Models

Learn Before

Related

Learn After