Research Article
High-Dimensional Text Clustering by Dimensionality Reduction and Improved Density Peak
Table 3
The run time, ratio, and standard deviation of each dimension reduction method reduce the dimension to 2,000, 500, and 100.
| | | | | Time (s) | Ratio | Standard deviation | Time (s) | Ratio | Standard deviation | Time (s) | Ratio | Standard deviation |
| BBC | PCA | 30.52 | 1.00 | 0.02 | 15.16 | 0.57 | 0.09 | 4.99 | 0.23 | 0.09 | MDS | 57.99 | 1.00 | 0.02 | 28.67 | 1.00 | 0.07 | 15.47 | 1.00 | 0.07 | RP | 1.12 | 1.00 | 0.04 | 0.56 | 1.00 | 0.08 | 0.41 | 1.01 | 0.18 | SRP | 2.87 | 1.00 | 0.05 | 2.38 | 1.00 | 0.07 | 2.27 | 1.00 | 0.12 |
| 20-newsgroups | PCA | 24.87 | 1.00 | 0.00 | 13.35 | 1.00 | 0.08 | 4.20 | 1.00 | 0.1 | MDS | 65.27 | 1.00 | 0.02 | 44.74 | 1.00 | 0.03 | 23.13 | 0.99 | 0.06 | RP | 0.10 | 0.99 | 0.06 | 0.46 | 1.00 | 0.12 | 0.31 | 0.99 | 0.28 | SRP | 3.33 | 1.00 | 0.05 | 2.85 | 1.00 | 0.07 | 2.72 | 1.00 | 0.15 |
|
|