Research Article

Research on Pedestrian Detection Algorithm Based on MobileNet-YoLo

Table 3

MobileNetv3 structural parameters.

Input1Operator2Exp size3Out4SE5NL6S7

Conv2d16HS2
Bottleneck, 3 × 31616RE1
Bottleneck, 3 × 36424RE2
Bottleneck, 3 × 37224RE1
Bottleneck, 5 × 57240RE2
Bottleneck, 5 × 512040RE1
Bottleneck, 5 × 512040RE1
Bottleneck, 3 × 324080HS2
Bottleneck, 3 × 320080HS1
Bottleneck, 3 × 318480HS1
Bottleneck, 3 × 318480HS1
Bottleneck, 3 × 3480112HS1
Bottleneck, 3 × 3672112HS1
Bottleneck, 5 × 5672160HS2
Bottleneck, 5 × 5960160HS1
Bottleneck, 5 × 5960160HS1

1Input represents the shape change of each feature layer; 2Operator represents the block structure that each feature layer is about to experience; 3Exp size, 4Out represent the number of channels that rise in the inverse residual structure within the neck, and the number of channels in the feature layer at the time of input to the neck, respectively; 5SE represents whether the attention mechanism is introduced at this layer; 6NL represents the type of activation function, HS represents h-swish, and RE represents RELU; 7S represents the step length used for each block structure.