Learn Before
A predictive text model is being trained. At an early stage of training (state t=100), it is given the context c = 'The sky is' and an additional instruction z = 'use a common color'. The model calculates the probability of the next word y = 'blue' as Pr^100('blue' | c, z) = 0.2. After extensive training (state t=5000), the model re-evaluates the same inputs and finds the probability to be Pr^5000('blue' | c, z) = 0.8. What is the most accurate interpretation of this change?
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A predictive text model is being trained. At an early stage of training (state t=100), it is given the context c = 'The sky is' and an additional instruction z = 'use a common color'. The model calculates the probability of the next word y = 'blue' as Pr^100('blue' | c, z) = 0.2. After extensive training (state t=5000), the model re-evaluates the same inputs and finds the probability to be Pr^5000('blue' | c, z) = 0.8. What is the most accurate interpretation of this change?
A language model is being prompted to generate a JSON object. The model is at training step 5000. The prompt is: 'Given the user's request to find a coffee shop, provide the output in a structured format.' The model is considering 'name' as the next part of the output. Match each component of the probability expression
Pr^t(y|c, z)to its corresponding element in this scenario.LLM Probability Distribution Notation ()
Analyzing Chatbot Behavior with Conditional Probability