Activity (Process)

Rejection Sampling for LLM Fine-Tuning

Rejection sampling is a technique for fine-tuning Large Language Models by incorporating human preferences. The process involves generating a list of N-best outputs, using a reward model to identify the highest-quality responses from this list, and then using this curated set of 'best' outputs as the data for fine-tuning the LLM.

0

1

Updated 2026-05-03

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.4 Alignment - Foundations of Large Language Models