Parameter Selection via Loss Minimization
Based on the training objective of minimizing the total error over the entire dataset, which set of parameters (Model A or Model B) would the training process select? Justify your answer by calculating the total error for each model.
0
1
Tags
Ch.1 Pre-training - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Ch.4 Alignment - Foundations of Large Language Models
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A language model is trained on a dataset by finding the parameters that optimize the following objective: Which statement best analyzes the relationship between this optimization objective and the principle of Maximum Likelihood Estimation (MLE)?
Parameter Selection via Loss Minimization
Deconstructing the Training Objective