In the architectural flow for generating a single new token, a decoder-only model processes the input sequence through multiple layers. After the final decoder layer produces its output vector, what is the immediate and primary purpose of applying a final linear mapping and a Softmax function?
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A decoder-only language model generates text one token at a time. Arrange the following computational steps in the correct order for generating a single new token, given a prompt and any previously generated tokens.
In the architectural flow for generating a single new token, a decoder-only model processes the input sequence through multiple layers. After the final decoder layer produces its output vector, what is the immediate and primary purpose of applying a final linear mapping and a Softmax function?
In the architectural flow for generating a single new token, a decoder-only model performs several distinct operations. Match each architectural component with its primary function during this single-step process.