Research Article

Content Deduplication with Granularity Tweak Based on Base and Deviation for Large Text Dataset

Table 6

Deduplication elimination ratio.

Size (MB)NM-DERGD-DER

501.8518518522.057613169
421.7895185341.88781014
361.3333333331.388888889
231.7692307691.965811966
151.6666666671.984126984
91.2820512821.780626781
57.0422535217.30994152
12.0408163272.267573696