Research Article

Deep Ensemble Learning for Human Action Recognition in Still Images

Figure 2

The whole framework of our proposed algorithm. Firstly, the original input (1) is converted into the normalized input (2). Then, our proposed NCNN module is attached to the top of the pretrained model (e.g., VGG16). Furthermore, the NCNN-based model is trained to carry out the NCNN-based learning (3). In order to take advantage of ensemble learning, the DELWO model (4) aims to fuse multiple different types of models to directly explore more aspects of the “truth,” and finally, the DELVS model (5) integrates multiple classifiers to obtain the final prediction using different voting strategies.