1Cademy - In a sequence-processing model using an attention mechanism, the model needs to determine which words in an input sentence are most relevant to the current word it is processing. If the Key vectors associated with every word in the input sentence were made identical to each other, what would be the most direct consequence for the attention calculation?

Learn Before

Key (Attention)

Multiple Choice

In a sequence-processing model using an attention mechanism, the model needs to determine which words in an input sentence are most relevant to the current word it is processing. If the 'Key' vectors associated with every word in the input sentence were made identical to each other, what would be the most direct consequence for the attention calculation?

Updated 2025-09-26

Contributors are:

Who are from:

Learn Before

Related