1Cademy - Critique of the Oracle Reward Model for Urban Planning

Learn Before

Oracle Reward Model as an Ideal Solution to Overoptimization

Essay

Critique of the Oracle Reward Model for Urban Planning

An AI system is being designed to manage a city's public transportation network with the goal of 'improving citizen well-being.' A proposal is made to build a perfect, all-knowing reward model that can precisely measure this goal. Critique this proposal. In your answer, explain the theoretical appeal of such a model in this context and analyze the primary reasons why its practical implementation would be exceptionally difficult, if not impossible.

Updated 2025-10-06

Contributors are:

Who are from:

Learn Before

Related