Short Answer

Evaluating a Model Training Objective

A machine learning engineer sets up a training process for an encoder-decoder model where the encoder receives a text sequence with several words replaced by a [MASK] token. The model's objective is to have the decoder output the exact same sequence it received as input, including the [MASK] tokens. Critically evaluate this training objective. Would this approach be effective for teaching the model to generate coherent, uncorrupted text? Justify your reasoning.

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Evaluation in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science