Learn Before
Concept

Using Optimized Predictions as Learning Targets

In certain training methodologies, the target for the learning process is generated by the model itself rather than being a pre-existing ground truth label. This involves identifying the output that maximizes an objective function, such as a log-probability. This optimized output is then used as the target for a loss function, which in turn guides the model's parameter updates.

0

1

Updated 2026-01-15

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences