Multiple Choice

A team of engineers is adapting a pre-trained language model to handle much longer text sequences. They decide to use a method that involves scaling the base value used in the model's rotational position embeddings. To select the appropriate scaling factor, they must adhere to a specific guiding principle. Which of the following best describes this principle?

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science