Concept

General Language Modeling Objective based on Joint Log-Probability

A general and foundational approach to language modeling involves treating an input x and an output y as a single, concatenated sequence. The training objective in this framework is to model the joint log-probability of the sequence, log Prθ(x, y). This is accomplished by minimizing a loss function that is calculated over all tokens in the combined sequence, [x, y], based on the chain rule of probability.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences