1Cademy - Weight Matrix Definition ($$\mathbf{W}_h \in \mathbb{R}^{d \times d

Learn Before

Information

Formula

Weight Matrix Definition ( $\mathbf{W}_h \in \mathbb{R}^{d \times d_h}$ )

This mathematical expression defines $\mathbf{W}_h$ as a weight matrix. The notation $\in \mathbb{R}^{d \times d_h}$ indicates that this matrix is composed of real numbers and has dimensions of $d$ rows and $d_h$ columns. In the context of neural networks, such a matrix is typically used as a set of learnable parameters within a linear layer to transform input vectors.

Updated 2026-06-26

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

A weight matrix in a computational model is defined by the expression $\mathbf{W} \in \mathbb{R}^{1024 \times 256}$ . Based solely on this mathematical expression, what can be concluded about the structure of the matrix $\mathbf{W}$ ?
A single computational layer is designed to process input vectors. If an input vector has a dimension of 512 and is transformed by a weight matrix defined as $\mathbf{W} \in \mathbb{R}^{512 \times 128}$ , what will be the dimension of the resulting output vector?
Defining a Transformation Matrix
Concatenated Weight Matrix ( $\mathbf{W}_h$ )

Learn Before

Related

Learn After