Learn Before
Examples of Constant Segment-Based Reward Functions
A simple segment-based reward function can assign a constant value to any segment, regardless of its content or context. For instance, the reward for any given segment can be set to a positive constant like 1, or a negative constant like -1. The formulas for these are: These represent simplified scoring mechanisms where all segments are treated as equally valuable or equally undesirable.

0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Computing Sciences
Foundations of Large Language Models Course
Related
Total Reward as Sum of Segment-Based Scores
Examples of Constant Segment-Based Reward Functions
A team is developing a reward model to score segments of text generated by a language model. The standard approach calculates a segment's score using the initial prompt, the complete generated output, and the specific segment being evaluated. To improve efficiency, a developer suggests modifying the process to calculate the score using only the initial prompt and the specific segment, omitting the rest of the generated output. What is the most significant analytical flaw in this modified approach?
Inputs for Segment-Based Reward Calculation
Role of Context in Segment-Based Reward
Learn After
Formula for a Negative Reward Function r(x, y, ȳ)
Formula for a Positive Reward Function r(x, y, ȳ)
A machine learning team is training a model to generate creative stories. They implement a reward mechanism where every segment of a generated story is assigned a score of exactly +1, irrespective of the segment's content, the initial prompt, or the rest of the story. Which of the following outcomes is the most likely consequence of this reward strategy?
Evaluating a Chatbot's Reward Function
A reward function that assigns a constant positive value (e.g., +1) to every segment of a generated text is an effective method for training a model to differentiate between well-written and poorly-written segments.