Selecting an Appropriate Language Model Training Objective
A machine learning team is working on two separate projects involving a large language model. Analyze the two scenarios described below and determine which training objective is more appropriate for each. Justify your reasoning for both scenarios.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Ch.5 Inference - Foundations of Large Language Models
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Relationship Between Joint, Conditional, and Marginal Log-Probabilities of Sequences
A developer is fine-tuning a language model on a dataset of
[instruction, response]pairs. Initially, the training process calculated the prediction loss across all tokens in both theinstructionand theresponse. The developer then modifies the process to calculate loss only on the tokens in theresponse. What is the primary effect of this change on the model's training objective?Analysis of Language Model Training Objectives
Selecting an Appropriate Language Model Training Objective