Research Article

TCMNER and PubMed: A Novel Chinese Character-Level-Based Model and a Dataset for TCM Named Entity Recognition

Table 5

The performances of different models in the test set of publications and the entire medical records.

%Publications (test)Medical records
PRF1PRF1

BiLSTM-CRF60.830.640.761.448.354.1
BERT-CRF58.754.256.454.760.557.4
BERT-BiLSTM9391.690.385.588.186.8
BERT-BiLSTM-CRF75.473.174.26975.271.9
RoBERTa-BiLSTM88.892.690.786.390.188.2
RoBERTa-c92.696.794.690.492.391.3