1Cademy - A transformer model incorporates a positional bias mechanism where a penalty is applied to the attention score between a query and a key. This penalty grows larger as the distance between the querys position and the keys position in the sequence increases. Given the sentence The quick brown fox jumps over the lazy dog, which of the following query-key pairs would receive the smallest penalty from this mechanism?

Learn Before

Kerple

Multiple Choice

A transformer model incorporates a positional bias mechanism where a penalty is applied to the attention score between a query and a key. This penalty grows larger as the distance between the query's position and the key's position in the sequence increases. Given the sentence 'The quick brown fox jumps over the lazy dog', which of the following query-key pairs would receive the smallest penalty from this mechanism?

Updated 2025-10-04

Contributors are:

Who are from:

Learn Before

Related