1Cademy - In a simple self-attention mechanism where similarity is measured by dot product and weights are normalized by a softmax function, if a current input vector `x_i` is perfectly orthogonal to a preceding input vector `x_j`, then `x_j` will have zero influence on the final output vector `y

Learn Before

Self-attention layers' first approach

True/False

In a simple self-attention mechanism where similarity is measured by dot product and weights are normalized by a softmax function, if a current input vector x_i is perfectly orthogonal to a preceding input vector x_j, then x_j will have zero influence on the final output vector y_i.