Research Article

[Retracted] Analyzing the Effect of Masking Length Distribution of MLM: An Evaluation Framework and Case Study on Chinese MRC Datasets

Table 3

Statistics of the proposed span extraction datasets.

Short span extractionLong span extraction
Train setDev setTest setTrain setDev setTest set

Paragraph #600150200600150200
Passage #15,26527702484775611631565
Question #31,3906774830912,05323765812
Max tokens in a context #512512512512512512
Max answer tokens #666999
Min answer tokens #444777