1Cademy - A policy optimization objective can be shown to be equivalent to minimizing a KL divergence. Arrange the following expressions to show the correct logical sequence of this mathematical derivation, starting from the point where the optimal policy $\pi^*$ has been substituted into the objective.

Learn Before

Derivation of the KL Divergence Objective for Policy Optimization

Sequence Ordering

A policy optimization objective can be shown to be equivalent to minimizing a KL divergence. Arrange the following expressions to show the correct logical sequence of this mathematical derivation, starting from the point where the optimal policy $\pi^*$ has been substituted into the objective.