Research Article

Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object Recognition

Figure 2

Architecture of the backbone network. The green blocks are input or output nodes (size labeled inside). The shallow blue blocks are the Conv operating set, and the deep blue blocks are the fusion process. The orange blocks are the maximum pooling layers. The values in parentheses indicate the filter size and stride.