Troubleshooting a Two-Stage Training Pipeline
Based on the workflow described in the case study, where data selection is a distinct step that precedes the main training loop, which component is the direct cause of the slow training cycles? Explain why the data selection stage is not the direct cause.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Computing Sciences
Foundations of Large Language Models Course
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Consider a training process where a small model first selects a subset of data from a large, initial dataset. A much larger model is then trained exclusively on this selected subset. If the large model trains without errors but ultimately performs poorly on its intended task, which of the following is the most likely reason for the failure, based on the logic of this specific workflow?
A machine learning pipeline uses a small model to select high-quality data for training a larger model. Arrange the following steps of this process into the correct chronological order.
Troubleshooting a Two-Stage Training Pipeline