Learn Before
Evaluating LLM Training Objectives
An AI research lab is experimenting with two different training objectives for a new language model, using the same large and diverse text dataset for both. After training, both models will be evaluated on their ability to perform a wide range of general language tasks, such as summarizing articles and answering questions. Read the descriptions of the two training objectives below and determine which model is more likely to succeed, justifying your choice based on fundamental training principles.
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Challenges of Scaling LLM Training
An AI development team is training a new language model on a large corpus of text. Their training algorithm repeatedly adjusts the model's internal parameters. The primary goal of these adjustments is to increase the model's ability to assign a high probability to the sequences of words that actually appear in the training corpus. Which fundamental principle of model training does this process exemplify?
Evaluating LLM Training Objectives
Implications of the Likelihood Maximization Objective