Concept

Trade-off between Search Quality and Computational Efficiency in Heuristic Search

Heuristic search algorithms used in LLM inference, such as greedy search and sampling-based methods, inherently involve a compromise between the quality of the generated output and the computational resources required. These methods are designed to approximate the optimal solution, and the specific approach chosen dictates the balance between achieving a high-quality result and maintaining computational efficiency.

0

1

Updated 2025-10-07

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences