Learn Before
A company is deploying a text-generation model and needs to choose the most energy-efficient hardware configuration. Their goal is to maximize the number of text generations for every unit of energy consumed. They test two options:
- Configuration X: Generates 200 text completions per minute and consumes 500 watts of power.
- Configuration Y: Generates 150 text completions per minute and consumes 250 watts of power.
Based on the stated goal, which configuration is the better choice and why?
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A company is deploying a text-generation model and needs to choose the most energy-efficient hardware configuration. Their goal is to maximize the number of text generations for every unit of energy consumed. They test two options:
- Configuration X: Generates 200 text completions per minute and consumes 500 watts of power.
- Configuration Y: Generates 150 text completions per minute and consumes 250 watts of power.
Based on the stated goal, which configuration is the better choice and why?
Diagnosing High Energy Costs in LLM Deployment
Evaluating Trade-offs in LLM Deployment