Learn Before
Evaluating an Action's Performance
Given the scenario below, calculate the advantage of the agent's chosen action and explain what the resulting value signifies about the performance of that specific action.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Evaluating an Action's Performance
An agent in a given state
stakes an actiona. The sequence of rewards it receives from that point until the end of the episode sums to a total of 10. The pre-calculated value for states, representing the average expected sum of future rewards from that state, is 15. Based on this information, what can be concluded about the actiona?Comparing Action Quality