Short Answer

Applying the Layer Normalization Formula

Consider an input vector h = [2, 5, 8]. This vector is processed by an operation defined by the formula: Output=αhμσ+ϵ+β\text{Output} = \alpha \cdot \frac{\mathbf{h} - \mu}{\sigma + \epsilon} + \beta where μ and σ are the mean and standard deviation of the elements in h, respectively. Given the learnable parameters α = [2, 2, 2] and β = [1, 1, 1], and assuming the numerical stability term ε is 0, calculate the final output vector. Provide your answer as a vector with values rounded to two decimal places.

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science