Research Article

Designing Compact Convolutional Filters for Lightweight Human Pose Estimation

Table 1

Specification for MobilePoseNet. SE denotes whether there is a squeeze-and-excite in that block. NL denotes the type of nonlinearity used. Here, HS denotes h-swish and RE denotes relu. LPB is our proposed lightweight upsampling block. bneck is the bottleneck block in MobileNetV3. is the number of key points.

Input channelInput sizeOperatorExp size#outAttentionNL

3Conv2d16HS2
16bneck, 1616RE1
16bneck, 6424RE2
24bneck, 7224RE1
24bneck, 7240SERE2
40bneck, 12040SERE1
40bneck, 12040SERE1
40bneck, 24080HS2
80bneck, 20080HS1
80bneck, 18480HS1
80bneck, 18480HS1
80bneck, 480112SEHS1
112bneck, 672112SEHS1
112bneck, 672160SEHS1
160bneck, 960160SEHS1
160bneck, 960160SEHS1
160LPB, 320160SERE2
160LPB, 320160SERE2
160Conv2d RE1