Optimal Sequence Selection
A language model is given the input sequence, x = 'The weather today is'. The model must determine the most appropriate complete sentence to generate. It calculates the total probability for several potential output sequences (y) given the input. Based on the fundamental mathematical objective for this task, which sequence should the model select as its final output, and why?
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A language model is generating a response based on a user's input. For this input, the model can generate many different possible sequences of words. The model's core task is to select the single best sequence from all these possibilities. According to the mathematical objective that governs this selection, which principle should the model follow?
Autoregressive Decomposition of the LLM Inference Objective
Optimal Sequence Selection
Search for Optimal Output Sequence in LLMs
Interpreting the LLM Search Objective