Multiple Choice

A robotic vacuum cleaner is learning to clean a room more efficiently. It uses a camera to see the room's layout and the location of dirt. When it successfully sucks up a patch of dirt, its internal 'dirt collected' counter increases. If it bumps into a wall, the counter decreases slightly to discourage this behavior. After each movement, it re-evaluates the room with its camera. In this learning framework, what best represents the 'reward'?

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science