A Novel Pyramid Network with Feature Fusion and Disentanglement for Object Detection

<div>The overview of the proposed FFAD cooperated with single-stage detector. The features of input images are first extracted by the backbone network, and then MF<sup>3</sup>M fuse these features through multiple paths. Finally, TaConv produces a multilevel feature pyramid. There are two parallel feature maps used to predict specific categories and regress precise boxes, respectively, at each level.</div>

Computational Intelligence and Neuroscience

fig2

Figure 2

Figure 2: A Novel Pyramid Network with Feature Fusion and Disentanglement for Object Detection