Research Article

Parallel Cleaning Algorithm for Similar Duplicate Chinese Data Based on BERT

Figure 4

MapReduce processing framework.