Research Article

Hierarchical Attention-Based Multimodal Fusion Network for Video Emotion Recognition

Table 2

The number of videos per category in MHED dataset.

CategoryAngerDisgustFearJoySadnessSurpriseTotal

Number1451571372202261811066
Average duration(s)14.628.9816.5712.4327.6012.1115.76