A robotic vacuum cleaner is learning to clean a room more efficiently. It uses a camera to see the room's layout and the location of dirt. When it successfully sucks up a patch of dirt, its internal 'dirt collected' counter increases. If it bumps into a wall, the counter decreases slightly to discourage this behavior. After each movement, it re-evaluates the room with its camera. In this learning framework, what best represents the 'reward'?
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A robotic vacuum cleaner is learning to clean a room more efficiently. It uses a camera to see the room's layout and the location of dirt. When it successfully sucks up a patch of dirt, its internal 'dirt collected' counter increases. If it bumps into a wall, the counter decreases slightly to discourage this behavior. After each movement, it re-evaluates the room with its camera. In this learning framework, what best represents the 'reward'?
A learning agent interacts with its environment in a continuous cycle. Arrange the following events into the correct logical sequence for a single step of this interaction.
Troubleshooting a Game-Playing AI