Deep Ensemble Learning for Human Action Recognition in Still Images

<div>The whole framework of our proposed algorithm. Firstly, the original input (1) is converted into the normalized input (2). Then, our proposed NCNN module is attached to the top of the pretrained model (e.g., VGG16). Furthermore, the NCNN-based model is trained to carry out the NCNN-based learning (3). In order to take advantage of ensemble learning, the DELWO model (4) aims to fuse multiple different types of models to directly explore more aspects of the “truth,” and finally, the DELVS model (5) integrates multiple classifiers to obtain the final prediction using different voting strategies.</div>

Complexity

fig2

Figure 2

Figure 2: Deep Ensemble Learning for Human Action Recognition in Still Images