1Cademy - Critic Network Performance Analysis

Learn Before

Critic Network Loss in A2C

Case Study

Critic Network Performance Analysis

An Advantage Actor-Critic (A2C) agent is being trained. The critic network is responsible for estimating the value of each state. You are given a snapshot of the training process for a single transition, with a discount factor (γ) of 0.9.

Analyze the critic network's prediction for the current state. Is the network's prediction an overestimate or an underestimate of the value, and by how much is the prediction off from the target value used for training?

Updated 2025-10-06

Contributors are: