Learn Before
Concept
Chatterbox: Evaluation Metrics
- The dependent variables in this experiment include execution time, answer quality, and workers satisfaction.
- Execution time is measured as the time (in seconds) between the start and submission of the task.
- Answer quality is measured by comparing the worker answers with ground truth sentiment analysis and image annotation; information finding and speech transcription tasks were inspected manually; human OCR task answers were compared to the label of the CAPTCHA.
- Worker satisfaction of both web and chatbot tasks is measured by default task ratings on F8 after workers finish the task.
0
1
Updated 2021-07-29
Tags
Psychology
Social Science
Empirical Science
Science