Research Article

Practical Skills of Business English Correspondence Writing Based on Data Mining Algorithm

Table 3

Sample statistics of training set and test set.

 Training setTest set

Total number of labeled samples24052145
Number of samples in category I535520
Category I sample proportion0.220.24
Number of samples in category II415365
Category II sample proportion0.160.16
Number of samples in category III920795
Category III sample proportion0.320.36
Number of samples in category IV530466
Category IV sample proportion0.220.21