Reward functions and performance metrics (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
For different purposes, different reward functions were used.
- The Goal: Maximize likelihood of expected number of recalled items:
- The Goal: Maximize likelihood of recalling all items:
In the paper the authors have defined the reward function as the average of the sum of the correct answers at every time step:
The reward function for the LSTM: Here n denotes number of items, j current interaction step, P_{RNN} the probability that the user will answer correctly item i and
0
1
Tags
Data Science
Related
Experimental Setup (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Reward functions and performance metrics (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Training the LSTM (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Relation between rewards and thresholds (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Performance of RL agent when the number of items are varied (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Performance of TRPO vs. TNPG algorithms (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Performance of TRPO with reward shaping (research objective) (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Comparison between likelihood and average of sum of outcomes based reward functions (research objective) (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Evaluation (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)