Research Article

PointTransformer: Encoding Human Local Features for Small Target Detection

Table 4

Comparison with the SOTA model.

MethodAPAP50AP75APsAPmAPl

FCOS28.139.531.316.535.748.1
EfficentNet-B0-based EfficientDet27.938.630.816.834.546.6
EfficentNet-B3-based EfficientDet30.840.833.617.036.249.1
Transformer-based Deformable DETR33.542.736.119.441.551.7
YOLOV5-X31.440.133.217.338.753.5
Our proposed model37.245.540.729.342.143.2