1Cademy - An agent is in a state `s` and must choose between two actions: `A` and `B`. According to the agents current policy, it chooses action `A` with a 70% probability and action `B` with a 30% probability. The expected total future reward for taking action `A` from state `s` is +20. The expected total future reward for taking action `B` from state `s` is -10. Based on this information, which of the following statements correctly describes the relationship between the value of being in state `s` and the values of taking each action?

Learn Before

State-Value and Action-Value Functions

Multiple Choice

An agent is in a state s and must choose between two actions: A and B. According to the agent's current policy, it chooses action A with a 70% probability and action B with a 30% probability. The expected total future reward for taking action A from state s is +20. The expected total future reward for taking action B from state s is -10. Based on this information, which of the following statements correctly describes the relationship between the value of being in state s and the values of taking each action?

Updated 2025-09-28

Contributors are:

Who are from:

Learn Before

Related