Learn Before
Concept

Optimal Reward Problem(ORP)

Fortunately, there is a mathematical proof that for every specific task, there exists one optimal reward set which could achieve the fastest convergence. However, it's very hard to find such a function. The problem is called optimal Reward Problem, which is a valuable research task of reinforcement learning and theoretical computer science area.

0

4

Updated 2021-04-12

Tags

Data Science