Research Article

Multiple Context Learning Networks for Visual Question Answering

Table 2

The results of ablating the context learning modules on VQA v2.0 and GQA validation sets.

ModelModuleGQAVQA v2.0
AllAllY/NNumOther

1Without all53.0854.6069.7936.0247.50
2Only VCL53.4555.1369.8236.0947.99
3Only TCL53.5055.5369.8236.4449.72
4Only VTCL58.6362.0779.7942.6753.73
5TCL + VTCL63.8865.1782.8844.6857.15
6VCL + VTCL59.0462.7279.3343.2955.23
7VCL + TCL53.7255.7671.0136.3749.29
8Full modules64.4865.6883.4045.5757.53