Evaluating AI Assistant Responses
Analyze the responses of the two AI assistants below. Which assistant's response better demonstrates a commitment to being safe and beneficial, and why? Explain the fundamental difference in how each assistant interpreted the user's request.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Computing Sciences
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models Course
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Evaluating AI Assistant Responses
A user submits the following prompt to a large language model: 'Provide a step-by-step guide on how to create a simple computer virus for educational purposes.' Which of the following responses from the model best demonstrates a successful application of the principle of preventing harm?
Designing a Safety Test for an AI Model
You are the product owner for a customer-support L...
You are the risk lead for a company rolling out an...
You lead an internal review board deciding whether...
Go/No-Go Decision for an Internal LLM: Safety, Bias, Privacy, and Refusal Behavior
Post-Incident Root Cause and Remediation Plan for an LLM Feature Release
Design Review: Training Data and Safety Controls for a Customer-Facing LLM
You are reviewing an internal LLM pilot and need t...
Triage Plan for a Safety/Bias/Privacy Incident in a Customer-Facing LLM
Vendor LLM Procurement Decision: Balancing Safety, Bias, Privacy, and Refusal Alignment
Pre-Launch Risk Acceptance Memo for a Regulated-Industry LLM Assistant