Learn Before
Diagnosing LLM Performance Issues
A technology company has successfully developed a large language model after an extensive initial training phase on a massive dataset of text and code from the internet. During internal testing, the team observes two primary issues:
- When given a direct command like 'Summarize the following paragraph about photosynthesis,' the model often continues the paragraph with more facts about photosynthesis rather than providing a summary.
- When asked open-ended questions like 'What are some creative and safe activities for a children's party?,' the model's suggestions are sometimes impractical or ignore the 'safe' constraint.
Based on these observations, which major development stage should the company focus on next to address these specific shortcomings? Justify your answer by explaining how the components of this stage would resolve the two observed issues.
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Ch.4 Alignment - Foundations of Large Language Models
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Human Preference Alignment via Reward Models
Limitation of Pre-trained LLMs: Next-Token Prediction vs. Instruction Following
Inference in LLMs
A development team tests two versions of a language model. They provide both models with the exact same input: 'Translate the following sentence into French: Hello, how are you?'
- Model A responds: '... I am doing well, thank you for asking. The weather is nice today.'
- Model B responds: 'Bonjour, comment allez-vous?'
Based on these outputs, what is the most likely difference in the training processes that Model A and Model B have undergone?
Classification of LLM Development Methods by Stage and Application Time
A team of AI developers is building a new large language model from scratch, aiming for it to be both knowledgeable and helpful in following user commands. Arrange the following key development stages in the typical chronological order they would be performed.
Diagnosing LLM Performance Issues
Typical Sequence of LLM Alignment Methods