Short Answer

Deconstructing the LLM Prediction Formula

A large language model uses the following formula to make a prediction when given a compressed summary of a document (σ) and a specific user query (z):

y^σ=argmaxyPr(yσ,z)\hat{y}_{\sigma} = \underset{y}{\arg\max}\, \text{Pr}(y|\sigma, z)

Break down this formula by explaining what each of the following four components represents in this specific scenario:

  1. σ
  2. z
  3. y
  4. ŷ_σ

0

1

Updated 2025-10-04

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science