1Cademy - A language model is defined by the following table of conditional log-probabilities, where `<s>` is the start-of-sequence token and `<eos>` is the end-of-sequence token: | Log-Probability | Value | |---|---| | `log Pr(A | <s>)` | -0.5 | | `log Pr(B | <s>)` | -1.5 | | `log Pr(B | A)` | -0.2 | | `log Pr(A | B)` | -1.0 | | `log Pr(<eos> | A)` | -2.0 | | `log Pr(<eos> | B)` | -0.1 | Given a training dataset `D` containing two sequences: - Sequence 1: `(A, B, <eos>)` - Sequence 2: `(B, A, <eos>)`

Learn Before

Applying Log-Likelihood Calculation to a Training Dataset

Multiple Choice

A language model is defined by the following table of conditional log-probabilities, where <s> is the start-of-sequence token and <eos> is the end-of-sequence token:

| Log-Probability | Value | |---|---| | log Pr(A | <s>) | -0.5 | | log Pr(B | <s>) | -1.5 | | log Pr(B | A) | -0.2 | | log Pr(A | B) | -1.0 | | log Pr(<eos> | A) | -2.0 | | log Pr(<eos> | B) | -0.1 |

Given a training dataset D containing two sequences:

Sequence 1: (A, B, <eos>)
Sequence 2: (B, A, <eos>)

Updated 2025-09-28

Contributors are:

Who are from:

Learn Before

Related