Research Article

Parallel Cleaning Algorithm for Similar Duplicate Chinese Data Based on BERT

Figure 2

Text to vector by BERT.