1Cademy - Training Data Construction for Sequence Models

Learn Before

Concept

Training Data Construction for Sequence Models

To construct training data from a sequence for autoregressive models operating under a $\tau^{\textrm{th}}$ -order Markov assumption, one extracts input–output pairs where each label is $y = x_t$ and the corresponding feature vector is $\mathbf{x}_t = [x_{t-\tau}, \ldots, x_{t-1}]$ . Because sufficient history is unavailable for the first $\tau$ time steps, those examples are dropped, yielding $T - \tau$ total examples from a sequence of length $T$ , each with a fixed input dimensionality of $\tau$ . Rather than padding the missing early observations with zeros, this simple truncation approach is commonly used. This construction relies on the stationarity assumption—the belief that the dynamics generating the sequence do not change over time—which ensures that patterns extracted from any historical segment remain relevant for predicting future values.

0

1

Updated 2026-06-30

Contributors are:

Who are from:

References

Dive into Deep Learning

Learn After

Sinusoidal Synthetic Sequence Data Example

Learn Before

Related

Learn After