Learn Before
Formula

Weight Matrix Definition (WhRd×dh\mathbf{W}_h \in \mathbb{R}^{d \times d_h})

This mathematical expression defines Wh\mathbf{W}_h as a weight matrix. The notation Rd×dh\in \mathbb{R}^{d \times d_h} indicates that this matrix is composed of real numbers and has dimensions of dd rows and dhd_h columns. In the context of neural networks, such a matrix is typically used as a set of learnable parameters within a linear layer to transform input vectors.

Image 0

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related