Research Article

An Efficient and Effective Model to Handle Missing Data in Classification

Table 1

Specifications of real-world datasets.

Dataset nameSample sizeVariable numberDiscrete variable numberMissing proportionImbalance

Breast Cancer Wisconsin [47]6991002.2965.5
Chronic kidney disease400241360.562.5
Congressional voting records435161646.6761.4
Credit approval6901595.3655.5
Cylinder bands540391948.757.8
Heart disease—ungarian29413799.6663.9
Hepatitis155191348.3979.4
Horse colic368231598.163
Mammographic mass [48]9615213.6353.7
Ozone level detection253673027.1397.1