Learn Before
Concept

Why Function Approximation is Needed?

For traditional Q-learning method, there might be no need to train a reward function. For these methods, a reward form is kept to record the reward for every (state, action) pair. However, to deal with the curse of the dimensionality, a function is provided instead of the form. We try to use the targeted Q value used in the updating process of Q-learning as the tag to train a predict network for that. That's exactly why function approximation is needed. If there exists a well-formed policy to search through (action, state) space to get the best action or fairly small (action, state) space, there is no need to do a function approximation.

0

2

Updated 2021-03-21

Tags

Data Science