1Cademy - A machine learning team is training a model to generate creative stories. They implement a reward mechanism where every segment of a generated story is assigned a score of exactly +1, irrespective of the segments content, the initial prompt, or the rest of the story. Which of the following outcomes is the most likely consequence of this reward strategy?

Learn Before

Examples of Constant Segment-Based Reward Functions

Multiple Choice

A machine learning team is training a model to generate creative stories. They implement a reward mechanism where every segment of a generated story is assigned a score of exactly +1, irrespective of the segment's content, the initial prompt, or the rest of the story. Which of the following outcomes is the most likely consequence of this reward strategy?

Updated 2025-10-02

Contributors are:

Who are from:

Learn Before

Related