Learn Before
Concept

Fixed Q Target

When updating the network for calculating the Q value, the network of selecting the best actions is not updated.

0

1

Updated 2021-08-12

References


Tags

Data Science

Related
Learn After