Research Article
Person Retrieval in Video Surveillance Using Deep Learning–Based Instance Segmentation
Figure 2
Architecture of YOLACT++ model. It uses ResNet-101 with FPN as the feature backbone, and it contains a classification structure (top branch) and a semantic segmentation structure (bottom branch) for the entire image.