Concept

Sampling-Based Search for LLM Inference

To introduce variation into the outputs of Large Language Models and overcome the lack of diversity found in deterministic approaches, sampling-based decoding methods are used. These heuristic algorithms approximate the optimal output by drawing samples from the model's probability distribution. This allows for the exploration of different potential sequences, making it a suitable technique for creative applications.

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences