Review Article

Emotionally Intelligent Chatbots: A Systematic Literature Review

Table 7

Manual evaluation criteria.

Evaluation criteria# of studiesStudies

Emotion accuracy3[49, 53, 78]
Response emotion quality and specificity4[47, 57, 60, 83]
Response emotion reflection and expression8[5557, 60, 74, 79, 80, 83]
Response emotion diversity8[8, 36, 39, 58, 59, 61, 64, 76]
Response emotion appropriateness10[8, 36, 39, 48, 58, 59, 61, 64, 69, 82]
Response empathetic emotion intensity7[8, 17, 6668, 82, 84]
Emotion intensity7[8, 17, 6668, 82, 84]
Response grammatical correctness11[36, 39, 48, 50, 51, 58, 59, 64, 76, 77, 80]
Response user preference1[64]
Response naturalness4[7, 50, 54, 80]
Response coherence5[7, 39, 54, 68, 80]
Response fluency4[17, 49, 68, 80]
Response relevance13[17, 36, 48, 49, 52, 57, 61, 6668, 72, 74, 80, 84]
Response consistency9[7, 5052, 5456, 73, 84]
Response logic5[5557, 62, 63]
Response intelligible2[62, 63]
Response context1[72]
Response politeness1[72]