Research Article
CNN with Embedding Transformers for Person Reidentification
Figure 4
Structure of TIC. TIC embeds an RT structure between conv2_x and conv3_x, conv3_x and conv4_x of ResNet-50, respectively. In the RT structure, first, the tokenizer maps the feature map Gin to be token vector Tt; then, it uses the transformer to get the output token vector Tout; finally, it uses the projector to reconstruct the feature map to be Gout.