Research Article

Designing Compact Convolutional Filters for Lightweight Human Pose Estimation

Table 5

Inference speed comparisons on the MSCOCO validation set. Speed refers to the result on non-GPU device. Speed refers to the result on GPU device. Bold values are the optimal results.

MethodBackbone#ParamsGFLOPsInput sizeAPSpeedSpeed

HRNetHRNetV1-W3228.5M7.174.47.519.2
HRNetHRNetV1-W3228.5M1675.8418.8
SimpleBaselineResNet-5034.0M8.970.48.1273.1
NLite-HRNet-18HRNet-W160.7M0.1962.81118.9
WNLite-HRNet-18HRNet-W161.3M0.3661218.6
ShuffleNetV2 1×ShuffleNetV27.6M1.2859.91771.3
ShuffleNetV2 1×ShuffleNetV27.6M2.8763.61064.1
MobileNetV2 1×MobileNetV29.6M1.4864.66.883.1
MobileNetV2 1×MobileNetV29.6M3.3367.34.573.1
Lite-HRNetLite-HRNet-181.1M0.264.81217.4
Lite-HRNetLite-HRNet-181.1M0.4567.67.116.3
MobilePoseNetMobileNetV31.5M0.5566.27.854.8
MobilePoseNetMobileNetV31.5M1.2369.05.150.8