Concept

Dynamic Segmentation for Reward Modeling

Dynamic segmentation is a method for partitioning an output sequence where segment boundaries are determined by the complexity of the content. For instance, segments can be defined by identifying significant changes in the reward score, which may indicate shifts in the task being modeled.

0

1

Updated 2026-05-03

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Computing Sciences

Foundations of Large Language Models Course