Research Article

PointTransformer: Encoding Human Local Features for Small Target Detection

Table 1

Influence of different backbone design on feature encoding.

Backbone designAPAP50AP75APsAPmAPl

YOLOV5-X31.440.133.217.338.753.5
Transformer encoder31.640.433.718.639.649.8
Self-attention and cross-attention34.542.137.124.242.145.6