Deconstructing the Training Objective
Consider the following mathematical expression which represents the goal of training a predictive model: Explain the role of each of the following three components in this expression: (1) the summation symbol (), (2) the term, and (3) the operator. How do they work together to define the training process?
0
1
Tags
Ch.1 Pre-training - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Ch.4 Alignment - Foundations of Large Language Models
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A language model is trained on a dataset by finding the parameters that optimize the following objective: Which statement best analyzes the relationship between this optimization objective and the principle of Maximum Likelihood Estimation (MLE)?
Parameter Selection via Loss Minimization
Deconstructing the Training Objective