Journal of Sensors

Research Article

Person Retrieval in Video Surveillance Using Deep Learning–Based Instance Segmentation

Figure 2

Architecture of YOLACT++ model. It uses ResNet-101 with FPN as the feature backbone, and it contains a classification structure (top branch) and a semantic segmentation structure (bottom branch) for the entire image.