Multiple Choice

A language model calculates the probability of a sequence of tokens, x=(x0,x1,,xm)\mathbf{x} = (x_0, x_1, \dots, x_m), using the product of conditional probabilities: Pr(x)=j=0mPr(xjx<j)\text{Pr}(\mathbf{x}) = \prod_{j=0}^{m} \text{Pr}(x_j|\mathbf{x}_{<j}). To improve numerical stability and simplify calculations, this product is converted into a sum by taking the logarithm. Which of the following expressions correctly represents the log-probability, logPr(x)\log \text{Pr}(\mathbf{x})?

0

1

Updated 2025-10-04

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science