Learn Before
Analyzing Reward Function Equivalence
A robotics team is training an agent to navigate a grid. They test two different reward functions. Both functions result in the agent learning the exact same optimal path. Your task is to determine the transformation function, f, that relates the two reward functions according to the formula:
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A research team is training an agent and finds that two different reward functions,
r_1andr_2, lead to the agent learning the exact same optimal behavior. The relationship between the two functions is defined asr_2(s_t, a_t, s_{t+1}) = r_1(s_t, a_t, s_{t+1}) + f(s_t, a_t, s_{t+1})for some non-zero functionf. What is the most accurate explanation for this phenomenon?Analyzing Reward Function Equivalence
Analyzing Reward Function Invariance