Research Article

PointTransformer: Encoding Human Local Features for Small Target Detection

Figure 5

(a) The original YOLOV5 head-layer is shown on the left. (b) Our proposed head-layer with positional features mapping is shown on the right. The dependence of the model on local features is further enhanced by mapping the positional features to the output layer.
(a)
(b)