1Cademy - Action-Value Function Formula

Learn Before

Action-Value Function Definition

Formula

Action-Value Function Formula

The action-value function, or Q-value function, denoted as $Q(s,a)$ , estimates the expected return when an agent starts from a specific state $s$ , immediately takes a particular action $a$ , and then adheres to a given policy $\pi$ for all subsequent decisions. It is formally defined as the expectation over all possible future trajectories:

$Q(s,a) = \mathbb{E} \Big[ \sum_{t=0}^{\infty} \gamma^{t} r_t \ \big | \ s_0 = s, a_0 = a, \pi \Big]$

Here, $s_0 = s$ designates the starting state, and $a_0 = a$ specifies the initial action taken. The parameter $\gamma$ represents the discount factor applied to future rewards, and $r_t$ is the reward obtained at time step $t$ .

0

1

Updated 2026-05-02

Contributors are:

Who are from:

References

Learn Before

Related

Learn After