Learn Before
Multiple Choice

A development team is tasked with deploying a large language model on a fleet of mobile devices with limited memory and computational power. To make the model run efficiently, they apply a compression technique that converts the model's high-precision floating-point parameters (e.g., 32-bit) to a lower-precision integer format (e.g., 8-bit). Which of the following outcomes represents the most significant and likely trade-off for this optimization?

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science