1Cademy - Neural Network Probability Factorization

Learn Before

Standard Auto-Regressive Probability Factorization using Embeddings

Short Answer

Neural Network Probability Factorization

An auto-regressive neural network is processing the token sequence (the, cat, sat). Using the notation e_token to represent the embedding for a given token, write out the full factorization of the joint probability Pr(the, cat, sat) as it would be computed by the model. Do not include a start-of-sequence token.

Updated 2025-10-03

Contributors are:

Who are from:

Learn Before

Related