Research Article

Machine Reading Comprehension-Enabled Public Service Information System: A Large-Scale Dataset and Neural Network Models

Table 2

The number of questions of each MRC dataset.

DatasetQuestion #Train Qu. #Dev Qu. #Test Qu. #Percentage of the train set

SQuAD2.0151,054130,31911,8738,86286.27
MCScript13,9399,7311,4112,79769.81
TQA26,26015,1545,3095,79757.71
SQuAD1.1107,70287,59910,5709,53381.33
MovieQA21,40614,1662,8444,39666.18
OpenBookQA5,9574,95750050083.21
DREAM10,1976,1162,0402,04159.98
WikiQA3,0472,11829663369.51
ARC-E5,1972,2515702,37643.31
ProPara488391544380.12
ARC-C2,5901,1192991,17243.20
C-Pulse20,00016,0002,0002,00080.00