Learn Before
Applying the State Function in Sequence Generation
Consider a model generating the sentence 'The cat sat on the mat.' After generating the first three words ('The', 'cat', 'sat'), the model uses a function, s(ȳ₁, ȳ₂, ȳ₃), to compute a summary based on these outputs. In this specific scenario, what does this summary represent, and how is it used by the model to determine the next word in the sequence?
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Analyzing State Function Impact on Sequence Generation
In a model that generates a sequence of items one by one, a function
s(ȳ₁, ..., ȳₖ₋₁)is used to compute a summary, or 'state', from thek-1items that have already been generated. What is the primary purpose of this computed state?Applying the State Function in Sequence Generation