Research Article

[Retracted] Analyzing the Effect of Masking Length Distribution of MLM: An Evaluation Framework and Case Study on Chinese MRC Datasets

Table 4

Statistics of the proposed multiple-choice cloze datasets.

Short multiple-choice clozeLong multiple-choice cloze
Train setDev setTest setTrain setDev setTest set

Passages #450010001000450010001000
Blanks #40,5009000900040,50090009000
Max tokens in a passage #100010001000100010001000
Max answer tokens #141414292929
Min answer tokens #777171717
Options #999999