Concept

Simplified Notation for Parameterized Models

In mathematical expressions involving parameterized models, it is a common convention to simplify the notation by omitting the explicit parameters. For instance, superscripts like WW (representing Softmax weights) and θ\theta (representing encoder parameters) may be dropped from probability distributions for brevity, even though the dependency on these parameters is still implied.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Computing Sciences

Foundations of Large Language Models Course