Short Answer

Targeted SFT Data Curation for Stylistic Control

A team is fine-tuning a large language model to be a concise and helpful assistant. They notice that while the model's answers are factually correct, they are often overly long and fail to adhere to specific formatting requests (like using bullet points). Describe one advanced data construction technique the team could implement in their next fine-tuning dataset to specifically address these two issues, and explain the mechanism by which this technique would improve the model's performance.

0

1

Updated 2025-10-07

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Computing Sciences

Foundations of Large Language Models Course

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science