Multiple Choice

A team is trying to accelerate inference for their Transformer-based language model. They are evaluating two approaches:

Approach 1: Modifying the decoding process to keep track of several high-probability next words at each step, rather than just the single most likely word.

Approach 2: Replacing the standard dot-product calculation within the model's attention layers with a faster, mathematically approximate version.

Which statement correctly categorizes these two approaches?

0

1

Updated 2025-10-03

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science