1Cademy - Computational Challenges of LLM Inference

Learn Before

Inference in LLMs

Concept

Computational Challenges of LLM Inference

During the inference stage of large language models, significant computational challenges arise from two primary operations: efficiently calculating the conditional probability $\Pr(\mathbf{y}|\mathbf{x})$ of potential output sequences given an input, and performing the $\argmax$ search operation to find the optimal sequence $\hat{\mathbf{y}}$ that maximizes this probability.

Updated 2026-05-03

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn Before

Related