Multiple Choice

In a relative position encoding scheme, a bias is determined by assigning the interaction between a query at position i and a key at position j to a specific bucket. For a certain range of small, non-negative offsets, this assignment uses a direct one-to-one correspondence, where the bucket index is simply the calculated offset i - j. Given a query at position i=7 and a key at position j=3, which bucket index would be assigned?

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science