Learn Before
Selecting an LLM Enhancement Strategy
An AI company has a single, powerful, general-purpose language model. They want to create two distinct product offerings from this one model:
- A version that is an expert in a stable, complex domain like contract law.
- A version that can provide up-to-the-minute, accurate answers on rapidly changing topics like stock market news.
For each product, decide which of the following two general strategies is more appropriate and explain why.
- Strategy A: Update the model's internal parameters by training it further on a large, specialized dataset.
- Strategy B: Keep the model's internal parameters fixed, but equip it with new capabilities during its application phase (e.g., by giving it access to external information sources or using more complex output generation procedures).
Your explanation should justify why your chosen strategy is a good fit for the product's goal and why the other strategy is less suitable.
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Ch.5 Inference - Foundations of Large Language Models
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Performance Enhancement via Long-Context Injection at Inference
Inference-Time Compute Scaling
Broader Definition of Inference-Time Scaling
Efficient Inference Scaling as a Promising Research Direction
Examples of Inference-Time Scaling in State-of-the-Art Systems
Using External Tools for Inference-Time Scaling
Inference-Time Scaling as a Key Method for Improving LLM Reasoning
A development team is tasked with improving the accuracy of a fully trained language model on complex logical puzzles. A key constraint is that they cannot modify the model's existing internal weights or parameters in any way. Which of the following strategies meets this requirement?
An AI development team is working on a large language model for a customer support chatbot. They have identified four potential strategies to improve its performance. Analyze each strategy and identify which one is an example of inference-time scaling.
Selecting an LLM Enhancement Strategy
Examples of Inference-Time Scaling in State-of-the-Art Models