Research Article
Active Semisupervised Clustering Algorithm with Label Propagation for Imbalanced and Multidensity Datasets
Algorithm 1
Selecting data by using MST clustering algorithm
.
| (1) Let , is the number of data points | | to be selected, is the size of dataset . | | (2) Use Prime method to construct MST of . | | (3) Foreach edge in MST do | | (4) Compute edge’s inconsistent value . | | (5) End Foreach | | (6) Sort all edges in descending order according to . | | (7) Insert the sorted edges into a list: edgesLst. | | (8) Foreach edge in edgesLst do | | (9) Delete edge from MST | | (10) Check the number of partitions in MST, num | | (11) If num == m then | | (12) Generate num clusters from MST | | (13) Break | | (14) End If | | (15) End Foreach | | (16) Foreach cluster T in do | | (17) Compute density of each point in T | | (18) Select one data with max density and add it to | | (19) End Foreach | | (20) Query oracle about labels of data in . | | (21) Return and . |
|