Learn Before
Concept
GloVe Squared Loss
GloVe employs a squared loss function to fit precomputed global corpus statistics, such as word-word co-occurrence counts. This approach is chosen because cross-entropy loss is often suboptimal for measuring the difference between two probability distributions when dealing with an extremely large corpus.
0
1
Updated 2026-05-25
Tags
D2L
Dive into Deep Learning @ D2L