1Cademy - Selective Gradient Propagation for Sub-sequence Loss

Learn Before

Sample-wise Negative Log-Likelihood Loss for a Sub-sequence
Structuring a Sample from Input and Output Segments

Activity (Process)

Selective Gradient Propagation for Sub-sequence Loss

In a practical implementation of back-propagation for a sub-sequence loss, the forward and backward passes behave differently. During the forward pass, the complete sequence, $[\mathbf{y}_{\mathrm{sample}},\mathbf{x}_{\mathrm{sample}}]$ , is constructed normally. However, during the backward pass, error gradients are exclusively propagated back through the portions of the network that correspond to the output sub-sequence, $\mathbf{y}_{\mathrm{sample}}$ . The rest of the network remains unchanged during this step.