Case Study

Impact of Penalty Coefficient on LLM Fine-Tuning

Analyze the two scenarios described in the case study below. For each scenario, predict the most likely behavior of the fine-tuned language model and explain your reasoning by referring to the components of the combined objective function used in training.

0

1

Updated 2025-10-03

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science