Learn Before
  • GeGLU (GELU-based Gated Linear Unit)

Applications of GeGLU in Large Language Models

The GeGLU (GELU-based Gated Linear Unit) activation function is utilized in the architecture of modern Large Language Models. For instance, the Gemma family of models incorporates GeGLU.

0

1

12 days ago

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related
  • GeGLU (GELU-based Gated Linear Unit) Formula

  • Applications of GeGLU in Large Language Models