Learn Before
Troubleshooting a Model Training Process
A machine learning engineer is training a large language model on a text corpus. After several iterations, they observe that the model's total loss value is fluctuating but not showing a consistent downward trend. Assuming the loss calculation itself is correct, which core component of the iterative optimization procedure is most likely misconfigured or malfunctioning, and why does this lead to the observed behavior?
0
1
Tags
Ch.1 Pre-training - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A data scientist is describing a single iterative step in the training process for a large language model that uses two distinct pre-training objectives. Which of the following descriptions accurately portrays the correct sequence of operations within that single step?
A large language model is being trained on a massive text corpus using an iterative optimization procedure. Arrange the following key operations into the correct sequence for a single training iteration.
Troubleshooting a Model Training Process