A development team is building a large language model and has meticulously removed all direct personal identifiers (names, phone numbers, addresses) from its massive training dataset. Despite this effort, they discover during red-teaming that the model can still reconstruct sensitive, context-specific information about individuals when given very specific and unusual prompts. Which of the following statements best analyzes this situation?
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Evaluating LLM Safety Measures Post-Anonymization
A development team is building a large language model and has meticulously removed all direct personal identifiers (names, phone numbers, addresses) from its massive training dataset. Despite this effort, they discover during red-teaming that the model can still reconstruct sensitive, context-specific information about individuals when given very specific and unusual prompts. Which of the following statements best analyzes this situation?
Assessing Anonymization Sufficiency