Research Article
TCMNER and PubMed: A Novel Chinese Character-Level-Based Model and a Dataset for TCM Named Entity Recognition
Table 4
The number of entities of 5 classes.
| Dataset | Number of entities | Number of samples | Clinical manifestation | Syndrome | Disease | Treatment law | Herb | Total |
| Publications | 18,150 | 9,043 | 1,327 | 10,698 | 9,689 | 48,907 | 79579 | Medical records | 75,177 | 37,053 | 1,428 | 9,968 | 57,836 | 181,462 | 14801 |
|
|