Concept

Training (Knowledge Tracing Machines: Factorization Machines for Knowledge Tracing)

During the training of Knowledge Tracing Machines, the negative log-likelihood function is minimized: NLL(p(X),y)=i=1Syilogp(xi)+(1yi)log(1p(xi))NLL(p(X), y) = \sum_{i=1}^S y_i \log p(x_i) + (1-y_i) \log (1-p(x_i)) Here, yiy_i represents the outcomes and xix_i represents the inputs. Biases and embeddings follow hyperpriors: wkN(μ,1/λ)w_k \sim N(\mu, 1/\lambda) and vkfN(μ,1/λ)v_{kf} \sim N(\mu, 1/\lambda). The parameters μ\mu and λ\lambda follow hyperpriors: mu sim N(0, 1) and lambda sim Gamma(0, 1). Due to the use of these hyperpriors, fine-tuning is not required.

0

1

Updated 2026-06-14

Tags

Data Science