Concept

Chatterbox: Evaluation Metrics

  • The dependent variables in this experiment include execution time, answer quality, and workers satisfaction.
  • Execution time is measured as the time (in seconds) between the start and submission of the task.
  • Answer quality is measured by comparing the worker answers with ground truth sentiment analysis and image annotation; information finding and speech transcription tasks were inspected manually; human OCR task answers were compared to the label of the CAPTCHA.
  • Worker satisfaction of both web and chatbot tasks is measured by default task ratings on F8 after workers finish the task.

0

1

Updated 2021-07-29

Tags

Psychology

Social Science

Empirical Science

Science