Research Article

Curvature-Driven Deformable Convolutional Networks for End-To-End Object Detection

Table 1

Detection results on PASCAL VOC 2007 test set. The detectors are Faster R-CNN and YOLOv4.

MethodShorter side (600)Faster R-CNNYOLOv4

BaselineRegular69.778.662.570.579.863.4
Deformationdconv@c3c5+71.881.465.373.082.565.9
dpool(DCNv1) [6]
Modulated deformationmdconv@c3c5+73.783.868.574.984.369.6
Mdpool [7]
Curvature-drivenC-dconv@c3c5+75.285.471.676.586.273.5
DeformationCdpool

The input images are of shorts side 600 pixels. In the setting column, “(m)dconv” and “(m)dpool” stand for (modulated) deformable convolution and (modulated) deformable RoIpooling, respectively. “C-dconv” and “C-dpool” stand for curvature-driven deformable convolution and curvature-driven deformable RoIpooling, And dconv@c3c5 stands for applying deformable conv layers at stages conv3conv5, C-dconv@c3c5″ stands for applying curvature-driven deformable conv layers at stages conv3conv5. The bold value means the best value of each item.