Concept
Implementation Details (Accelerating Human Learning With Deep Reinforcement Learning)
These are the parameters used by the authors for their experiments:
- (number of items)
- (number of steps)
- (delay between steps in seconds)
- For the EFC (Exponential Forgetting Curve) student model, the sample item difficulty () is from the distribution:
- For the HLR (Half-Life Regression) student model: where ; and (number of attempts, number correct, number incorrect, one-hot encoding of item out of items).
- For the GPL (Generalized Power-Law) student model: ; ; ; ; ;
- For the TRPO algorithm, the batch size is 4000, , and the step size is 0.01.
- For the Recurrent Network Policy, the number of hidden layers is 32.
0
1
Updated 2026-05-08
Contributors are:
Who are from:
Tags
Data Science
Related
Environments (Accelerating Human Learning With Deep Reinforcement Learning)
Research Question (Accelerating Human Learning With Deep Reinforcement Learning)
Analysis (Accelerating Human Learning With Deep Reinforcement Learning)
Baselines (Accelerating Human Learning With Deep Reinforcement Learning)
Implementation Details (Accelerating Human Learning With Deep Reinforcement Learning)