Learn Before
Multiple Choice

A researcher is implementing a positional bias mechanism for a multi-layer transformer model, as introduced by Chi et al. in 2023. The goal is to influence the attention scores based on the relative positions of tokens. Given the specific design of this method, which of the following implementation strategies is correct?

0

1

Updated 2025-10-04

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Computing Sciences

Foundations of Large Language Models Course

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science