Concept

Process-based Approaches for LLM Fine-Tuning

In process-based approaches to LLM fine-tuning, supervision is applied to each intermediate step of the model's reasoning process, not just the final outcome. This method requires the development of a supervisory model to provide signals at each step, as well as specialized loss functions designed to incorporate these granular supervision signals.

0

1

Updated 2026-05-03

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.4 Alignment - Foundations of Large Language Models