Learn Before
Concept

GloVe Squared Loss

GloVe employs a squared loss function to fit precomputed global corpus statistics, such as word-word co-occurrence counts. This approach is chosen because cross-entropy loss is often suboptimal for measuring the difference between two probability distributions when dealing with an extremely large corpus.

0

1

Updated 2026-05-25

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L