Concept

Fine-Tuning as Maximum Likelihood Estimation

In the context of fine-tuning, a common objective is to adjust the model's parameters, θ\theta, to maximize the likelihood of observing the true responses given the prompts in a dataset D={(x,y)}\mathcal{D} = \{(\mathbf{x}, \mathbf{y})\}. This is achieved by maximizing the sum of the log-likelihoods for all pairs in the dataset, which is mathematically equivalent to minimizing the negative log-likelihood loss.

Image 0

0

1

Updated 2026-04-30

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related