Learn Before
Concept

Sequence Depth in Recurrent Neural Networks

In recurrent neural networks, the length of the input sequence introduces a unique dimension of depth. To influence the final output, inputs from the first time step must propagate through a chain of TT layers corresponding to each time step. During backpropagation, this creates a chain of matrix-products of length O(T)\mathcal{O}(T), which frequently leads to numerical instability in the form of vanishing or exploding gradients.

0

1

Updated 2026-05-14

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L