Multiple Choice

A company has deployed a large language model for a customer support chatbot. They observe that a small number of common questions (e.g., 'What are your business hours?') account for a large portion of the daily traffic. The company is facing challenges with both high operational costs from running the model for every query and user complaints about slow response times. Which of the following deployment-focused strategies would be most effective at directly addressing both the cost and latency issues for these frequent, repetitive queries?

0

1

Updated 2025-10-03

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science