Short Answer

Distinguishing Optimization Strategies

A team is working to make their large language model respond faster. One engineer suggests reducing the model's size by removing some of its internal components. Another engineer suggests rewriting the underlying code to perform calculations more efficiently on the existing hardware. Explain which of these two approaches is an example of a 'system acceleration' technique and why the other is not.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science