Implementation Details (Accelerating Human Learning With Deep Reinforcement Learning)
These are the parameters that were used by the authors:
-
n = 30 (number of items)
-
T = 200 (number of steps)
-
D = 5 (delay between steps (seconds))
-
For EFL (exponential forgetting curve) sample item difficulty () is from the distribution:
-
For HLR (Half-Life Regression) : (num attempts, num correct, num incorrect, one-hot encoding of item i out of n items).
-
For GPL (Generalized Power-Law)
-
For TRPO batch size is 4000, ; step_size = 0.01
-
For Recurrent Network Policy number of hidden layers is 32
0
1
Tags
Data Science
Related
Environments (Accelerating Human Learning With Deep Reinforcement Learning)
Implementation Details (Accelerating Human Learning With Deep Reinforcement Learning)
Research Question (Accelerating Human Learning With Deep Reinforcement Learning)
Analysis (Accelerating Human Learning With Deep Reinforcement Learning)
Baselines (Accelerating Human Learning With Deep Reinforcement Learning)