1Cademy - Output Variation in Sequence Models

Learn Before

General Formulation of a Sequence Model

Classification

Output Variation in Sequence Models

The output $\mathbf{o}$ from a general sequence model, which is generated by a neural network $g(\cdot; \theta)$ , can differ based on the specific problem being addressed. For token prediction problems (such as language modeling), the output $\mathbf{o}$ is typically a probability distribution over a defined vocabulary. Conversely, for sequence encoding problems, the output $\mathbf{o}$ serves as a representation of the input sequence, commonly expressed as a sequence of real-valued vectors.