Learn Before
Architectural Design for a Real-Time Chat Application
A team is designing a chatbot for a live, multi-user chat platform where conversations evolve rapidly. To provide the model with relevant information, they propose an architecture where a large, static knowledge base is indexed into a separate database. Before generating a response, the system queries this database to find the most relevant information based on the user's prompt. Evaluate the suitability of this architectural choice for the specified application. Justify your conclusion by considering the nature of the information context in this scenario.
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Using Retrieved Context to Improve Attention
Retrieval-Based Methods as a Solution for Long-Context Processing
Unsuitability of External Memory for Streaming Contexts
k-NN as a Popular Retrieval-Based External Memory Method
Computational Cost of External Memory Models
Architectural Design for a Real-Time Chat Application
A company is building a question-answering system to help employees query a massive, static knowledge base of over 100,000 internal documents. The core language model has a fixed input size that is much smaller than the total size of the knowledge base. Which approach is the most effective and scalable for ensuring the model can access the necessary information to answer specific user queries accurately?
Evaluating the Use of External Memory Systems for LLMs
Augmented Input Formula for External Memories
Comparison of External Memories in LLMs