1Cademy - Why Function Approximation is Needed?

Learn Before

Reward in Reinforcement Learning

Concept

Why Function Approximation is Needed?

For traditional Q-learning method, there might be no need to train a reward function. For these methods, a reward form is kept to record the reward for every (state, action) pair. However, to deal with the curse of the dimensionality, a function is provided instead of the form. We try to use the targeted Q value used in the updating process of Q-learning as the tag to train a predict network for that. That's exactly why function approximation is needed. If there exists a well-formed policy to search through (action, state) space to get the best action or fairly small (action, state) space, there is no need to do a function approximation.

Updated 2021-03-21

Contributors are:

Ge Zhang

🏆 3

Who are from:

University of Michigan - Ann Arbor

🏆 3

References

Useful Tutorials about Math behind Reinforcement Learning

Learn Before

Related