Short Answer

Designing a Performance Metric for Summarization Prompts

Imagine you are building an automated system to find the most effective prompt for a language model that summarizes complex scientific papers for a high school audience. The goal is to produce summaries that are both accurate and easy to understand. Describe a concrete, automatable method your system could use to score the quality of summaries generated by different prompts. What specific, measurable criteria would your scoring method rely on?

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Creation in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science