Reward functions and performance metrics (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
For different purposes, different reward functions were used. 1. The Goal: Maximize likelihood of expected number of recalled items: 2. The Goal: Maximize likelihood of recalling all items: In the paper the authors have defined the reward function as the average of the sum of the correct answers at every time step: , where . The reward function for the LSTM: . Here denotes the number of items, the current interaction step, the probability that the user will answer correctly item , and .
0
1
Contributors are:
Who are from:
Tags
Data Science
Related
Experimental Setup (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Training the LSTM (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Relation between rewards and thresholds (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Performance of RL agent when the number of items are varied (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Performance of TRPO vs. TNPG algorithms (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Performance of TRPO with reward shaping (research objective) (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Comparison between likelihood and average of sum of outcomes based reward functions (research objective) (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Evaluation (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)
Reward functions and performance metrics (Using deep reinforcement learning for personalizing review sessions on e-learning platforms with spaced repetition)