Research Article
Content Deduplication with Granularity Tweak Based on Base and Deviation for Large Text Dataset
Table 6
Deduplication elimination ratio.
| Size (MB) | NM-DER | GD-DER |
| 50 | 1.851851852 | 2.057613169 | 42 | 1.789518534 | 1.88781014 | 36 | 1.333333333 | 1.388888889 | 23 | 1.769230769 | 1.965811966 | 15 | 1.666666667 | 1.984126984 | 9 | 1.282051282 | 1.780626781 | 5 | 7.042253521 | 7.30994152 | 1 | 2.040816327 | 2.267573696 |
|
|