Research Article

A Deep Multimodal Model for Predicting Affective Responses Evoked by Movies Based on Shot Segmentation

Table 5

With or without capture changes in audio and visual feature sequences using LSTM.

Model (with Features6)Experienced arousal (loss1)Experienced valence (loss2)
MSEPCCMSEPCC

Ours without LSTM0.02880.58260.07510.3276
Ours0.02750.61870.06320.3443