Definition

Action-Value Function Definition

The action-value function, often referred to as the Q-value function, evaluates the anticipated return an agent will accumulate by starting in a specific state ss, executing a particular action aa, and then strictly adhering to a given policy π\pi for all subsequent decisions.

0

1

Updated 2026-05-01

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences