Research Article

Leukemia can be Effectively Early Predicted in Routine Physical Examination with the Assistance of Machine Learning Models

Table 4

Accuracy scores of model performances on different scenarios, train-set/test-set ratios, and regularizations. (Strain and Stest are accuracy scores of models on train data subset and test data subset respectively according to the score() function of sklearn; Sauc is the area under the curve score of model according to the roc_auc_score() function of sklearn; Rtrain/total is the ratio of train data to the total data) for random forest modelfor linear SVM model (C is the overfitting suppression factor).

CRtrain/total0.750.50.25
ScenarioABCDEABCDEABCDE

0.01Strain0.84490.79350.7710.82340.86870.8710.88050.9150.9170.92930.92310.91360.92620.92250.9365
Stest0.86140.79990.76680.82970.88540.88410.87720.89430.90870.91820.93340.91370.91190.91350.9288
Sauc0.50.50.54790.71410.83150.57670.69460.79710.84780.88040.75780.79350.82840.8540.8957

1Strain0.98260.96490.9610.96090.96660.9840.97160.97230.97270.97550.9850.97520.9740.97350.9755
Stest0.98110.96760.96430.96640.96810.98260.97160.96660.97210.97270.9840.9730.97230.97740.9788
Sauc0.93350.91990.93140.94590.95390.93650.92950.93590.95520.96050.94210.93540.94920.96310.969

100Strain0.99270.98510.98740.98570.98550.99490.98380.98990.98990.98940.99420.9860.98860.98770.9899
Stest0.98550.98150.98020.97660.97590.99130.98580.98110.98040.98110.99270.98650.98740.98450.98
Sauc0.95350.96060.96920.9650.96870.97490.9710.96760.96960.97430.98690.97480.98030.97510.973