Research Article

RGB-D Human Action Recognition of Deep Feature Enhancement and Fusion Using Two-Stream ConvNet

Table 3

Comparison of accuracy of adding nonlocal to different locations of st-gcn.

NetworkTop 1Top 5

Baseline81.5%
1-block85.23%98.74%
2-block86.43%98.62%
3-block85.43%97.12%
4-block82.14%97.2%
5-block85.55%97.31%
1-2-block85.63%96.32%
1-3-block84.08%95.67%
1-4-block84.24%92.35%
2-2-block87.62%97.3%
2-3-block84.1%95.2%
2-4-block84.41%94.69%
3-3-block83.77%94.12%
3-4-block80.19%91.63%
4-4-block77.09%91.03%
5-5-block77.75%90.12%