Research Article
Similarity Measurement and Classification of English Characters Based on Language Features
Table 1
Average similarity of vocabulary pairs with status 1 under different data numbers of different algorithms.
| Number of data | 500 | 1000 | 1500 | 2000 | 2500 | 3000 | 3500 | 4000 | 4500 | 5801 |
| method1 | 0.717 | 0.742 | 0.728 | 0.731 | 0.730 | 0.729 | 0.731 | 0.731 | 0.730 | 0.730 | method2 | 0.713 | 0.718 | 0.721 | 0.725 | 0.724 | 0.722 | 0.724 | 0.724 | 0.723 | 0.723 | method3 | 0.373 | 0.378 | 0.376 | 0.378 | 0.378 | 0.377 | 0.376 | 0.380 | 0.378 | 0.380 | method4 | 0.649 | 0.652 | 0.658 | 0.661 | 0.663 | 0.662 | 0.664 | 0.664 | 0.663 | 0.664 | method5 | 0.841 | 0.837 | 0.839 | 0.841 | 0.845 | 0.844 | 0.846 | 0.846 | 0.846 | 0.846 |
|
|