Learn Before
Concept
Bandit problems
Problems which add uncertainty to learning the best alternative. Rewards for potential alternatives are represented by distributions (not fixed outcomes). Bandit problems can model pharmaceutical drug trials, choice among technologies, where to place advertisements and anything else with uncertain payoffs. Can help figure out optimal explore-exploit tradeoff.
0
1
Updated 2023-10-06
Contributors are:
Who are from:
Tags
Psychology
Social Science
Empirical Science
Science