Research Article

Leukemia can be Effectively Early Predicted in Routine Physical Examination with the Assistance of Machine Learning Models

Table 2

Accuracy scores of model performances on different scenarios, train-set/test-set ratios, and regularizations. (Strain and Stest are accuracy scores of models on train data subset and test data subset respectively according to the score() function of sklearn; Sauc is the area under the curve score of model according to the roc_auc_score() function of sklearn; Rtrain/total is the ratio of train data to the total data) for random forest model.

Rtrain/totalScenarioABCDE

0.75Strain11111
Stest0.99510.98780.9870.98340.9844
Sauc0.98550.97470.97790.97970.9808

0.5Strain11111
Stest0.99490.98710.98550.98160.9855
Sauc0.98590.97440.97450.97670.9808

0.25Strain11111
Stest0.99560.98510.98990.98690.9822
Sauc0.98860.96920.98360.98030.9755