Research Article

Content Deduplication with Granularity Tweak Based on Base and Deviation for Large Text Dataset

Figure 17

Hadoop architecture.