Classifying a Novel Pre-training Method
Based on the provided scenario, should the pre-training task be classified as one method or two? Justify your answer by explaining the most effective principle for categorizing such tasks.
0
1
Tags
Ch.1 Pre-training - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A research team trains two different models: one with an encoder-only structure and another with an encoder-decoder structure. Both models are trained using the exact same objective: predicting randomly masked words within a text. A colleague argues that because the model structures are different, they should be classified under separate pre-training categories. Why is this argument fundamentally flawed from a conceptual standpoint?
Classifying a Novel Pre-training Method
Justification for Pre-training Task Classification