1Cademy - LLM Training and Fine-Tuning

Learn Before

Large Language Models (LLMs)

Activity (Process)

LLM Training and Fine-Tuning

The training or fine-tuning of a Large Language Model involves adjusting its trainable parameters to improve performance on a task. This is achieved by calculating a 'Loss' value, which quantifies the difference between the model's predictions and the correct target outputs. This loss is then used in an optimization algorithm, like backpropagation, to update the parameters, such as the model's internal weights or the embeddings of a soft prompt.

Updated 2025-10-10

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn After

Classification of LLM Adaptation Methods
RLHF Policy Optimization as Loss Minimization
A development team is fine-tuning a large language model for a specific task using a dataset of inputs and corresponding correct outputs. During a training iteration, the model produces an output that is very different from the correct target output. What is the immediate, primary function of this discrepancy within the training process?
Direct Supervision via Knowledge Distillation Loss in Weak-to-Strong Generalization
A large language model is undergoing a single step of fine-tuning on a new dataset. Arrange the following events in the correct chronological order to represent this process.
Data Selection and Filtering using Small Models
Diagnosing a Stagnant Fine-Tuning Process

Learn Before

Related

Learn After