Research Article

Beyond Words: An Intelligent Human-Machine Dialogue System with Multimodal Generation and Emotional Comprehension

Table 3

In-depth analysis of different emotion recognition methods.

MethodPrecisionRecallF1

SVM-(Speech)45.9345.1645.54
SVM-(Visual)52.0152.1052.25
SVM-(Speech + Visual)59.2859.7959.53

CNN + RNN-(Speech)65.9364.8365.43
CNN + RNN-(Visual)69.2367.2968.25
CNN + RNN-(Speech + Visual)71.8172.0171.91

Bold values indicate the best results.