1Cademy - A robotic vacuum cleaner is learning to clean a room more efficiently. It uses a camera to see the rooms layout and the location of dirt. When it successfully sucks up a patch of dirt, its internal dirt collected counter increases. If it bumps into a wall, the counter decreases slightly to discourage this behavior. After each movement, it re-evaluates the room with its camera. In this learning framework, what best represents the reward?

Learn Before

The Agent-Environment Interaction Loop in Reinforcement Learning

Multiple Choice

A robotic vacuum cleaner is learning to clean a room more efficiently. It uses a camera to see the room's layout and the location of dirt. When it successfully sucks up a patch of dirt, its internal 'dirt collected' counter increases. If it bumps into a wall, the counter decreases slightly to discourage this behavior. After each movement, it re-evaluates the room with its camera. In this learning framework, what best represents the 'reward'?

Updated 2025-10-06

Contributors are:

Who are from:

Learn Before

Related