Multiple Choice

An engineer is adapting a language model to handle longer text sequences. The goal is to find a scaling factor, λ, for the positional encoding base, b. The method involves setting the period of the highest frequency component in the new, adapted model equal to the period of a model scaled by linear interpolation. The dimensionality of the embeddings is d, the original sequence length is m_l, and the new sequence length is m. This constraint is captured by the following equation: 2π(λb)2(d21)d=mml2πb2(d21)d2\pi \cdot (\lambda b)^{\frac{2(\frac{d}{2}-1)}{d}} = \frac{m}{m_l} \cdot 2\pi \cdot b^{\frac{2(\frac{d}{2}-1)}{d}} Which part of this equation represents the period of the highest frequency dimension for the new model being developed?

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science

Related