1Cademy - Padding in Sequence Batching

Learn Before

Batching Sequences of Varying Lengths

Concept

Padding in Sequence Batching

To handle sequences of varying lengths within a single batch, a technique called padding is often used. This involves adding special ⟨pad⟩ tokens to the shorter sequences until they match the length of the longest sequence in the batch. This creates a uniform tensor shape, which is necessary for efficient parallel computation in many deep learning frameworks. The image shows this by adding three ⟨pad⟩ tokens to the shorter of the two sequences.