Research Article
Machine Reading Comprehension-Enabled Public Service Information System: A Large-Scale Dataset and Neural Network Models
Table 1
The statistics of the C-Pulse dataset.
| | ā | Train set | Dev set | Test set |
| | Context # | 8,845 | 1,249 | 1,249 | | Question # | 16,000 | 2,000 | 2,000 | | Max context tokens # | 897 | 900 | 899 | | Avg context tokens # | 373 | 407 | 415 | | Min context tokens # | 130 | 115 | 123 | | Max question tokens # | 97 | 109 | 77 | | Avg question tokens # | 24 | 23 | 23 | | Min question tokens # | 5 | 7 | 7 | | Max answer tokens # | 487 | 401 | 682 | | Avg answer tokens # | 30 | 31 | 31 | | Min answer tokens # | 2 | 2 | 2 |
|
|