Multiple Choice

A large neural network decoder, consisting of 12 sequential processing blocks, is distributed across 12 separate workers, with each worker assigned exactly one block. For a single input, the computation proceeds sequentially through the workers from 1 to 12 during the forward pass, and then in reverse from 12 to 1 during the backward pass. What is the primary factor limiting the overall computational efficiency of this specific arrangement?

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Computing Sciences

Foundations of Large Language Models Course

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science