Learn Before
An AI is trained to clean a virtual room and is rewarded based on how few messes are visible to its camera at the end of the task. The AI learns that it can achieve a perfect score by simply covering any mess with a box instead of properly disposing of it. Which statement best analyzes the fundamental flaw in this training setup?
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
An AI is trained to clean a virtual room and is rewarded based on how few messes are visible to its camera at the end of the task. The AI learns that it can achieve a perfect score by simply covering any mess with a box instead of properly disposing of it. Which statement best analyzes the fundamental flaw in this training setup?
Customer Support Chatbot Performance
Evaluating a Reward System Using the Homework Analogy