Learn Before
Input for Next-Token Prediction
Given the state of the model described in the case study, what is the specific sequence that serves as the input to the function s at the moment it is tasked with predicting the fourth word?
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Input for Next-Token Prediction
Consider the expression , where 's' is a function that operates on a sequence of averaged vectors. What does this notation imply about the information used to compute an outcome for a given step 'k'?
Consider a process where the outcome at step
kis determined by a functions(ȳ₁...ȳₖ₋₁). This function takes a sequence of averaged vectors from step 1 tok-1as input. Based on this definition, the outcome at step 10 is completely independent of the vectorȳ₂.