Learn Before
Example Prompt for Evaluating Customer Service Responses
A prompt for an AI-based preference labeling task in customer service typically establishes a scenario, outlines criteria for a high-quality reply, and provides the text to be evaluated. For example, the prompt might instruct the LLM to choose between two responses based on being courteous, clear, and concise. It then presents the specific customer request (e.g., asking about a delayed order) alongside two options—such as an empathetic apology for the delay (Response A) and a dismissive, unprofessional reply (Response B). The AI is expected to review the criteria and output the preference label, such as "Response A is preferred."
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Ch.4 Alignment - Foundations of Large Language Models
Related
Example Prompt for Evaluating Customer Service Responses
Critique of a Customer Service Response
A customer sends the following message: 'Hi, I bought a shirt last week but it's the wrong size. Can I return it?'
Which of the following is the most effective response to the customer's inquiry?
A customer writes: 'My package was supposed to arrive yesterday, but the tracking still says it's 'in transit'. Where is it?' Match each of the following poor responses to the primary quality criterion it fails to meet.
Learn After
Example of a Customer Inquiry about a Delayed Order
A team is creating a set of instructions for human evaluators who will be rating AI-generated customer service replies. The goal is for the evaluators to choose the better of two possible replies to a customer's message. Which of the following instructions would be most effective for this task?
Critique and Revision of an Evaluation Prompt
Applying Evaluation Criteria to Customer Service Responses