Research Article

[Retracted] Analyzing the Effect of Masking Length Distribution of MLM: An Evaluation Framework and Case Study on Chinese MRC Datasets

Table 7

The number of questions of each MRC dataset.

DatasetQuestion #Train question #Dev question #Test question #Percentage of the train set

SQuAD2.0151,054130,31911,873886286.27%
SQuAD1.1107,70287,59910,570953381.33%
TQA26,26015,1545309579757.71%
MovieQA21,40614,1662844439666.18%
MCScript13,93997311411279769.81%
DREAM10,19761162040204159.98%
ARC-E51972251570237643.31%
WikiQA3047211829663369.51%
ARC-C25901119299117243.20%
ProPara488391544380.12%
Short span46,47331,3906774830967.54%
Long span20,24112,0532376581259.55%
Short cloze58,50040,5009000900069.23%
Long cloze58,50040,5009000900069.23%