Research Article

An Enhanced Machine Learning Framework for Type 2 Diabetes Classification Using Imbalanced Data with Missing Values

Table 1

Summary of machine learning solutions for diabetes classification.

StudyPreprocessing approachAlgorithmSignificance

Guldogan et al. [16]Missing data deletedMultilayer perceptron (MLP) and radial basis function (RBF)MLP giving an accuracy of 78.1%
Alam et al. [17]Median value imputationANN, random forest, K-means clusteringANN giving an accuracy of 75.7%
Wang et al. [18]naïve Bayes imputation, ADASYN oversamplingRandom forest87.10% accuracy
Sarwar et al. [19]Method details not statedK-nearest neighbors, naïve Bayes, support vector machine, decision tree, logistic regression, random forestK-nearest neighbors and support vector machine giving accuracy 77%
Woldemichael and Menaria [20]Mean value imputationBackpropagation, support vector machine, J48, naïve BayesBackpropagation giving an accuracy of 83.11%
Sisodia and Sisodia [21]Method details not statedDecision tree, support vector machine, naïve BayesNaïve Bayes giving an accuracy of 76.30%
Wu et al. [22]Mean value imputationK-means + logistic regression95.42% accuracy
Komi et al. [13]Method details not statedGaussian mixture model, support vector machine, logistic regression, extreme learning machine, ANNANN giving an accuracy of 89.0%
AlThunayan et al. [15]Method details not statedBayesian, naïve Bayes, J48, random tree, random forest, REP tree, FT tree, CART, SMORandom forest giving an accuracy of 84.11%
Vaishali et al. [23]Missing data kept unchangedNaïve Bayes, J48, MLP, MOE fuzzy classifierMOE fuzzy giving accuracy of 83.04%
Geman et al. [24]Missing data kept unchangedHybrid adaptive neural network84.27% accuracy
Ramesh et al. [25]Missing data kept unchangedRecurrent deep neural Network (RNN)0.145 error rate
Soltani and Jafarian [14]Missing data kept unchangedANN types: PNN, MLP, RBF, GRNNPNN giving an accuracy of 81.49%
Ahmad et al. [2]Mean value imputationMLP, ID3, J48J48 giving an accuracy of 89.3%
AlJarullah [11]Missing data deletedJ4878.17% accuracy
Marcano-Cedeño et al. [26]Missing data deletedArtificial metaplasticity on multilayer perceptron algorithm (AMMLP)89.93% accuracy
Patil et al. [3]Missing data deletedK-means + C4.592.38% accuracy
Lekkas and Mikhailov [27]Missing data deletedRule-based fuzzy classification79.37% accuracy
Kahramanli and Allahverdi [28]Method details not statedANN + fuzzy neural network84.24% accuracy
Han et al. [29]Missing data deletedID3, decision treeID3 giving an accuracy of 80.0%