1Cademy - In a self-attention mechanism designed for a machine translation encoder, which processes an entire source sentence at once, the relative position offset between a query at position `i` and a key at position `j` (calculated as `i - j`) must always be greater than or equal to zero.

Learn Before

Comparison of Position Offsets in Causal vs. Bidirectional Attention

True/False

In a self-attention mechanism designed for a machine translation encoder, which processes an entire source sentence at once, the relative position offset between a query at position i and a key at position j (calculated as i - j) must always be greater than or equal to zero.

Updated 2025-10-05

Contributors are: