Learn Before
Concept
Optimal Reward Problem(ORP)
Fortunately, there is a mathematical proof that for every specific task, there exists one optimal reward set which could achieve the fastest convergence. However, it's very hard to find such a function. The problem is called optimal Reward Problem, which is a valuable research task of reinforcement learning and theoretical computer science area.
0
4
Updated 2021-04-12
Tags
Data Science