Research Article
Multiscale Meets Spatial Awareness: An Efficient Attention Guidance Network for Human Parsing
Figure 2
An overview of the Attention Guidance Network (AG-Net). Our network is established upon an Encoder-Decoder architecture extracting features from four resolution layers (i.e., , , , ), and the Attention RefineNet learns the attention score maps to optimize the predicted label results. We impose the Attention SPP into the end of the encoder and leverage a multiscale supervision strategy to refine our model.