Learn Before
GeGLU (GELU-based Gated Linear Unit)
Applications of GeGLU in Large Language Models
The GeGLU (GELU-based Gated Linear Unit) activation function is utilized in the architecture of modern Large Language Models. For instance, the Gemma family of models incorporates GeGLU.
0
1
12 days ago
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Related
GeGLU (GELU-based Gated Linear Unit) Formula
Applications of GeGLU in Large Language Models