Concept

Advice for future evaluation

The ways of how human evaluation of creative NLG systems should be conducted:

  • Define the goals: Once the goals are clearly stated, it is easy to see the degree to which your implementation solution tries to achieve those goals, and how much can be attributed to the method and how much to the training data.

  • Go concrete: By using evaluation questions that are as concrete as possible, you can reduce the room for subjective interpretation.

  • Run some tests: The same concept can be evaluated through multiple different wordings. It is better to adjust your evaluation questions sooner than after running a costly crowd-sourcing.

  • Run multiple evaluations

  • Report everything clearly

  • Analyze your results

0

1

Updated 2022-07-31

Tags

Data Science