Baselines (Accelerating Human Learning With Deep Reinforcement Learning)
The authors compare TPRO to other four baseline schedulers:
- Random Policy
- Leitner System
- Variant SuperMemo
- Threshold-based Policy
As it was described in the research paper a threshold-based policy has direct access the student simulator's parameters when it calculates recall likelihoods, and it was used as an upper bound in the authors experiments.
0
1
Tags
Data Science
Related
Environments (Accelerating Human Learning With Deep Reinforcement Learning)
Implementation Details (Accelerating Human Learning With Deep Reinforcement Learning)
Research Question (Accelerating Human Learning With Deep Reinforcement Learning)
Analysis (Accelerating Human Learning With Deep Reinforcement Learning)
Baselines (Accelerating Human Learning With Deep Reinforcement Learning)