Learn Before
Concept
Skip-Gram Forward Propagation Logic
In the forward propagation of the skip-gram model, the input consists of center word indices of shape and concatenated context and noise word indices of shape . These two sets of indices are first transformed into dense vectors via an embedding layer. Following this transformation, a batch matrix multiplication is performed between the embedded center words and the embedded context and noise words. This operation returns an output of shape , where each individual element represents the dot product between a center word vector and a context or noise word vector.
0
1
Updated 2026-05-25
Tags
D2L
Dive into Deep Learning @ D2L