Formula for a Positive Reward Function r(x, y, ȳ)
The function assigns a constant positive value. This function's arguments are the vectors and , and the average vector . In machine learning, a value of 1 often signifies a positive reward or indicates a positive sample.

0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Related
Function of a Sequence of Average Vectors Notation ()
Average Value Notation for a Specific Group ()
Function of a Sequence of Average Vectors Notation ()
Formula for a Positive Reward Function r(x, y, ȳ)
Formula for a Negative Reward Function r(x, y, ȳ)
A study tracks the final exam scores for students in four different sections of a computer science course. The variable 's' represents an individual student's score. Which of the following expressions correctly represents the average final exam score for only the third section of the course?
Match each mathematical expression with its correct description by analyzing the components of the notation (e.g., bar, bold font, subscript).
Function of a Sequence of Average Vectors Notation ()
Interpreting Model Performance Metrics
Formula for a Negative Reward Function r(x, y, ȳ)
Formula for a Positive Reward Function r(x, y, ȳ)
A machine learning team is training a model to generate creative stories. They implement a reward mechanism where every segment of a generated story is assigned a score of exactly +1, irrespective of the segment's content, the initial prompt, or the rest of the story. Which of the following outcomes is the most likely consequence of this reward strategy?
Evaluating a Chatbot's Reward Function
A reward function that assigns a constant positive value (e.g., +1) to every segment of a generated text is an effective method for training a model to differentiate between well-written and poorly-written segments.
Learn After
A machine learning model uses the reward function r(x, y, ȳ) = 1 to evaluate data segments, where x, y, and ȳ are vectors representing different aspects of the data. If the model processes a segment where x = [0.1, 0.9], y = [1, 0], and ȳ = [0.6, 0.4], what is the reward value assigned to this segment?
For the reward function defined as , the output value is dependent on the specific values of the input vectors and .
Evaluating a Constant Reward Function