Learn Before
Diagnosing a Faulty Knowledge-Augmented System
A company is building a customer support chatbot using a pre-trained language model. The goal is for the bot to answer highly technical questions by referencing a large, private database of product manuals. The team implements a system where, for each user query, relevant passages from the manuals are retrieved and provided to the model as context. However, they observe that the chatbot frequently ignores the provided passages and instead generates generic, often incorrect, answers based on its original pre-training data. The team's training approach involved fine-tuning the entire language model on the raw text of all the product manuals, hoping it would 'absorb' the knowledge. Based on this information, critique the team's training strategy. What is the most likely reason their approach is failing, and what principle have they misunderstood about integrating a model with an external knowledge source?
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Fine-Tuning LLMs to Enhance RAG Performance
A financial services company wants to deploy a chatbot to help its advisors answer client questions. The chatbot must use the company's proprietary, 500-page market analysis report, which is updated weekly. The company uses a powerful, general-purpose pre-trained language model but finds it gives generic advice, not specific insights from the report. Given the need for up-to-date, report-specific answers and a desire to minimize computational costs, which approach is most suitable?
Diagnosing a Faulty Knowledge-Augmented System
Distinguishing Roles in a Memory-Augmented System