Learn Before
Evaluating Surrogate Objectives for a Mental Well-being AI
An AI development team is tasked with modifying a social media platform's content recommendation algorithm. The true, intended objective is to 'improve the mental well-being of its users.' The team proposes three different measurable surrogate objectives to train the AI on. Evaluate the following three options. In your response, identify the most promising surrogate objective and justify your choice by analyzing the potential benefits and, more importantly, the potential negative consequences or failure modes of each option.
- Maximize daily user engagement time on the platform.
- Maximize the ratio of positive reactions (e.g., 'like', 'love') to negative reactions (e.g., 'angry') on content shown to the user.
- Maximize scores on a voluntary, daily in-app survey asking users to rate their mood.
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Evaluating Surrogate Objectives for a News-Summarizing AI
A development team is training an AI to write helpful and engaging online tutorials. The true, complex objective is to 'create content that effectively teaches users a new skill.' To make this measurable, the team chooses a surrogate objective: 'maximize the word count of the tutorial and the number of technical terms used.' Which of the following outcomes is the most likely form of misalignment to result from this choice?
Evaluating Surrogate Objectives for a Mental Well-being AI