Research Article

Hierarchical Attention-Based Multimodal Fusion Network for Video Emotion Recognition

Table 9

Top-1 accuracy (%) comparing state-of-the-art methods on Ekman-6 and VideoEmotion-8.

MethodEkman (%)VideoEmotion-8 (%)

Emotion in context [10]51.850.6
Xu et al. [33]50.446.7
Kernelized feature [26]54.449.7
Concept selection [27]54.4050.82
Graph-based network [36]55.0151.77
CAAN [37]56.2352.5
Ours57.753.13