Concept

Bandit problems

Problems which add uncertainty to learning the best alternative. Rewards for potential alternatives are represented by distributions (not fixed outcomes). Bandit problems can model pharmaceutical drug trials, choice among technologies, where to place advertisements and anything else with uncertain payoffs. Can help figure out optimal explore-exploit tradeoff.

0

1

Updated 2023-10-06

Tags

Psychology

Social Science

Empirical Science

Science