Relation

Causal Language Modeling as a Special Case of Masked Language Modeling

Conventional Causal Language Modeling (CLM) can be conceptualized as a specific instance of Masked Language Modeling (MLM). In this view, for any given position in a sequence, the prediction task is equivalent to an MLM task where all tokens in the right-context are masked. The model is then trained to predict the token at the current position using only its available left-context.

0

1

Updated 2026-04-15

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences