Learn Before
Concept

Skip-Gram Forward Propagation Logic

In the forward propagation of the skip-gram model, the input consists of center word indices of shape (batch size,1)(\text{batch size}, 1) and concatenated context and noise word indices of shape (batch size,max_len)(\text{batch size}, \text{max\_len}). These two sets of indices are first transformed into dense vectors via an embedding layer. Following this transformation, a batch matrix multiplication is performed between the embedded center words and the embedded context and noise words. This operation returns an output of shape (batch size,1,max_len)(\text{batch size}, 1, \text{max\_len}), where each individual element represents the dot product between a center word vector and a context or noise word vector.

0

1

Updated 2026-05-25

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L