Learn Before
Interpreting Conditional Probabilities in Text
Imagine a system analyzing a large collection of English sentences. It calculates two values:
- The probability that any randomly selected word is 'keys'.
- The probability that a word is 'keys', given that the preceding word was 'piano'.
Which of these two probabilities would you expect to be higher? Justify your reasoning by explaining what each probability represents and how the context provided by the preceding word influences the likelihood.
0
1
Tags
Data Science
Ch.1 Pre-training - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Chain Rule
Autoregressive Conditional Probability
General Notation for Conditional Probability Models
Prediction via Optimization
A language model is analyzing a text corpus of 10,000 two-word phrases. The analysis reveals the following counts:
- The word 'deep' is the first word in 400 phrases.
- The word 'learning' is the second word in 250 phrases.
- The specific phrase 'deep learning' occurs 80 times.
Based on this data, what is the probability that the second word of a phrase is 'learning', given that the first word is 'deep'?
Predictive Text Model Comparison
Interpreting Conditional Probabilities in Text