1Cademy - Output Ensembling

Learn Before

General Inaccuracies and Limitations of LLMs
Inference-Time Compute Scaling

Concept

Output Ensembling

Output ensembling is a technique used to enhance the performance of Large Language Models. It involves using a single prompt to generate multiple distinct outputs from the same model, typically through a sampling process. These candidate predictions are then aggregated, for instance by combining them or selecting the best one, to produce a single, more reliable final prediction.