Learn Before
Concept
Further Explanation of Exploration
Exploration refers to taking actions specifically to obtain more training data. If we know that given context x , action a gives us a reward of 1, we do not know whether that is the best possible reward. We may want to exploit our current policy and continue taking action a to be relatively sure of obtaining a reward of 1. However, we may also want to explore by trying action a. We do not know what will happen if we try action d. We hope to get a reward of 2, but we run the risk of getting a reward of 0. Either way, we at least gain some knowledge.
0
1
Updated 2021-07-08
Tags
Data Science