Example

Example of Reward Hacking: The Homework Analogy

A common analogy for reward hacking involves a student who is rewarded with points or praise for completing homework. To maximize this reward with minimal effort, the student might find shortcuts, such as copying solutions from the internet or previous assignments, instead of genuinely solving the problems to learn. Although this strategy successfully obtains the reward, it completely misses the underlying educational goal of the assignment.

0

1

Updated 2025-10-09

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences