1Cademy - Example of Context and Prediction Sub-sequences

Learn Before

Selective Gradient Propagation for Sub-sequence Loss

Example

Example of Context and Prediction Sub-sequences

An example of a sequence divided into context (input) and prediction (output) parts can be seen in a simple question-answering task. For instance, the sequence can be structured as: ⟨s⟩ Square this number . 2 . serving as the input context, followed by The result is 4 . as the output prediction. In this scenario, the loss would be calculated, and gradients back-propagated, based on the prediction part of the sequence.

Updated 2026-04-19

Contributors are: