1Cademy - An AI development team has fine-tuned a large language model primarily to follow user commands. The model excels at tasks with clear, explicit instructions (e.g., Summarize this article in three bullet points). However, for more open-ended prompts (e.g., Explain quantum computing in a simple way), its responses are often factually correct but overly technical, verbose, and not genuinely helpful for a layperson. Which of the following strategies best addresses this specific shortcoming by building upon the models existing capabilities?

Learn Before

Combined Use of Instruction and Human Preference Alignment

Multiple Choice

An AI development team has fine-tuned a large language model primarily to follow user commands. The model excels at tasks with clear, explicit instructions (e.g., 'Summarize this article in three bullet points'). However, for more open-ended prompts (e.g., 'Explain quantum computing in a simple way'), its responses are often factually correct but overly technical, verbose, and not genuinely helpful for a layperson. Which of the following strategies best addresses this specific shortcoming by building upon the model's existing capabilities?

Updated 2025-09-26

Contributors are:

Who are from:

Learn Before

Related