Research Article

Research on Intelligent Video Detection of Small Targets Based on Deep Learning Intelligent Algorithm

Table 1

Brief introduction and comparison of lightweight networks.

Model nameReference volume/106Basic introductionClassification effect on ImageNet/%

MobileNet v14.20Lightweight network model that can be used for mobile, using deep separable convolution instead of normal convolution; the number of parameters and the amount of computation are greatly reduced, but the straight-cylinder structure is not sufficient for feature learning70.6
MobileNet v23.40The hourglass residual structure is introduced to enhance the gradient propagation and reduce the computation, and the ReLU function is removed from the last layer to preserve the feature diversity72.0
MobileNet v35.40The model structure is improved by using the network structure search algorithm, while the SE module is introduced to strengthen the network learning ability by combining with the channel attention mechanism, and the h-swish activation function is proposed to improve the accuracy75.2
ShuffleNet v11.90The point-by-point group convolution is used to reduce the computational complexity of 1 × 1 convolution, and the channel transformation method is proposed to induce the flow of information in the same channel of different features, but the difference between the number of input and output channels is still too large to affect the efficiency67.8
ShuffleNet v22.30Abandon group convolution and introduce channel splitting operation to reduce the number of network branches to obtain faster detection speed with certain accuracy69.4
ShuffleNet1.25Replace the 3 × 3 convolution with 1 × 1 convolution, reduce the number of convolution channels, and postpone the sampling operation to significantly reduce the number of parameters and the amount of computation so as to maximize the speed increase in exchange for the reduction in accuracy57.5