1Cademy - An agent in an environment completes a sequence of two actions. It starts in an initial state `s₀`, performs action `a₀` to reach state `s₁`, and then performs action `a₁` to reach the final state `s₂`. Which of the following notations correctly represents the full sequence of state-action pairs, often called a trajectory (τ)?

Learn Before

Notational Variations in State-Action Sequences (Trajectories)

Multiple Choice

An agent in an environment completes a sequence of two actions. It starts in an initial state s₀, performs action a₀ to reach state s₁, and then performs action a₁ to reach the final state s₂. Which of the following notations correctly represents the full sequence of state-action pairs, often called a trajectory (τ)?

Updated 2025-09-26

Contributors are:

Who are from:

Learn Before

Related