Learn Before
Consider a process where the outcome at step k is determined by a function s(ȳ₁...ȳₖ₋₁). This function takes a sequence of averaged vectors from step 1 to k-1 as input. Based on this definition, the outcome at step 10 is completely independent of the vector ȳ₂.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Input for Next-Token Prediction
Consider the expression , where 's' is a function that operates on a sequence of averaged vectors. What does this notation imply about the information used to compute an outcome for a given step 'k'?
Consider a process where the outcome at step
kis determined by a functions(ȳ₁...ȳₖ₋₁). This function takes a sequence of averaged vectors from step 1 tok-1as input. Based on this definition, the outcome at step 10 is completely independent of the vectorȳ₂.