Chain Rule
Any joint probability can be decomposed into conditional probabilities using Chain Rule. For example:
0
1
Tags
Data Science
Related
Combinatorics
Uncertainty
Common Probability Distributions
Frequentist Probability
Marginal Probability
Conditional Probability
Chain Rule
Independence and Conditional Independence
Bayesian Probability
Random Variable
Probability Distribution
Chain Rule
Autoregressive Conditional Probability
General Notation for Conditional Probability Models
Prediction via Optimization
A language model is analyzing a text corpus of 10,000 two-word phrases. The analysis reveals the following counts:
- The word 'deep' is the first word in 400 phrases.
- The word 'learning' is the second word in 250 phrases.
- The specific phrase 'deep learning' occurs 80 times.
Based on this data, what is the probability that the second word of a phrase is 'learning', given that the first word is 'deep'?
Predictive Text Model Comparison
Interpreting Conditional Probabilities in Text