JGRCAN: A Visual Question Answering Co-Attention Network via Joint Grid-Region Features

<div>Visualization example of the attention mechanism in JGRCAN. The 1<sup>st</sup> line of the figure is the input image and question; the 2<sup>nd</sup> line is the important information that the region feature pays attention to; the 3<sup>rd</sup> line is the important information that the grid feature pays attention to; and the important information is circled in red.</div>

Mathematical Problems in Engineering

fig8

Figure 8

Figure 8: JGRCAN: A Visual Question Answering Co-Attention Network via Joint Grid-Region Features