Formula for a Negative Reward Function r(x, y, ȳ)
The function assigns a constant negative value. This function's arguments are the vectors and , and the average vector . In machine learning, a value of -1 often signifies a negative reward, a penalty, or indicates a negative sample.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Computing Sciences
Related
Function of a Sequence of Average Vectors Notation ()
Average Value Notation for a Specific Group ()
Function of a Sequence of Average Vectors Notation ()
Formula for a Positive Reward Function r(x, y, ȳ)
Formula for a Negative Reward Function r(x, y, ȳ)
A study tracks the final exam scores for students in four different sections of a computer science course. The variable 's' represents an individual student's score. Which of the following expressions correctly represents the average final exam score for only the third section of the course?
Match each mathematical expression with its correct description by analyzing the components of the notation (e.g., bar, bold font, subscript).
Function of a Sequence of Average Vectors Notation ()
Interpreting Model Performance Metrics
Formula for a Negative Reward Function r(x, y, ȳ)
Formula for a Positive Reward Function r(x, y, ȳ)
A machine learning team is training a model to generate creative stories. They implement a reward mechanism where every segment of a generated story is assigned a score of exactly +1, irrespective of the segment's content, the initial prompt, or the rest of the story. Which of the following outcomes is the most likely consequence of this reward strategy?
Evaluating a Chatbot's Reward Function
A reward function that assigns a constant positive value (e.g., +1) to every segment of a generated text is an effective method for training a model to differentiate between well-written and poorly-written segments.
Learn After
In a machine learning system, a penalty is assigned to every generated output segment using the function
r(x, y, ȳ) = -1, wherexis the input,yis the generated output, andȳis the average of all possible outputs. If the system generates an output segment, what is the value of the penalty it receives according to this function?Evaluating a Reward Function for Story Generation