A machine learning team is training a model to generate creative stories. They implement a reward mechanism where every segment of a generated story is assigned a score of exactly +1, irrespective of the segment's content, the initial prompt, or the rest of the story. Which of the following outcomes is the most likely consequence of this reward strategy?
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Computing Sciences
Foundations of Large Language Models Course
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Formula for a Negative Reward Function r(x, y, ȳ)
Formula for a Positive Reward Function r(x, y, ȳ)
A machine learning team is training a model to generate creative stories. They implement a reward mechanism where every segment of a generated story is assigned a score of exactly +1, irrespective of the segment's content, the initial prompt, or the rest of the story. Which of the following outcomes is the most likely consequence of this reward strategy?
Evaluating a Chatbot's Reward Function
A reward function that assigns a constant positive value (e.g., +1) to every segment of a generated text is an effective method for training a model to differentiate between well-written and poorly-written segments.