Research Article

An Ensemble Semantic Textual Similarity Measure Based on Multiple Evidences for Biomedical Documents

Table 4

Description of the data set.

cluster_numdoc_num_clusterdoc_num_data

Min31084
Max123851541
Average6.988.4609.4

Note: cluster_num represents the number of clusters in the 100 data sets, doc_num_cluster represents the number of documents contained in each cluster, and doc_num_data represents the number of documents contained in each data set.