Research Article

JGRCAN: A Visual Question Answering Co-Attention Network via Joint Grid-Region Features

Table 3

Experiment results compared with other state-of-the-art models on the test-dev and test-std sets.

AccuracyTest-devTest-std
Methods (%)AllY/NNumberOtherAll

Bottom-up [3]65.3281.8244.2156.0565.67
BAN [16]69.5285.3150.9360.2669.84
BAN + counter [16]70.0485.4254.0460.5270.35
MuRel [29]68.7684.7749.8457.8568.41
DFAF [30]70.2286.0953.3260.4970.34
MCAN [2]70.6386.8253.2660.7270.90
JGRCAN70.8786.9753.1561.4571.18