Research Article

Content Deduplication with Granularity Tweak Based on Base and Deviation for Large Text Dataset

Table 2

Document vector space ().

Document vector space ()Topic_1Topic_2Topic_3

D10.400.000.00
D20.970.000.00
D30.000.810.00
D40.000.780.00
D50.000.000.82
D60.000.000.82
D70.890.000.00
D80.000.630.00
D9−4.46139E−177.64073E−164.36267E−16