Formula

Value Weight Matrix Definition (WjvRd×dτ\mathbf{W}_j^v \in \mathbb{R}^{d \times \frac{d}{\tau}})

This formula defines Wjv\mathbf{W}_j^v as a value weight matrix. It is an element of the set of real-numbered matrices (R\mathbb{R}) with dimensions d×dτd \times \frac{d}{\tau}. In the context of attention mechanisms, the superscript vv typically indicates that this is a 'value' matrix, and the subscript jj often refers to the jj-th attention head in a multi-head attention setup.

Image 0

0

1

Updated 2026-04-19

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related