[Retracted] An Effective Machine Learning-Based Model for an Early Heart Disease Prediction

Bizimana, Pierre Claver; Zhang, Zuping; Asim, Muhammad; Abd El-Latif, Ahmed A.

doi:https://doi.org/10.1155/2023/3531420

BioMed Research International

On this page

Abstract Introduction Related Works Experimental Results Conclusion Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Research Article Retraction

!

This article has been Retracted. To view the article details, please click the ‘Retraction’ tab above.

Special Issue

Computer-Aided Diagnosis of Pleural Mesothelioma: Recent Trends and Future Research Perspectives

View this Special Issue

Research Article | Open Access

Volume 2023 | Article ID 3531420 | https://doi.org/10.1155/2023/3531420

[Retracted] An Effective Machine Learning-Based Model for an Early Heart Disease Prediction

Pierre Claver Bizimana,¹Zuping Zhang,¹Muhammad Asim,^2,3and Ahmed A. Abd El-Latif^2,4

Academic Editor: Abolfazl Mehbodniya

Received22 May 2022

Revised28 Jun 2022

Accepted18 Jul 2022

Published29 Apr 2023

Abstract

Heart disease (HD) has become a dangerous problem and one of the most significant mortality factors worldwide, which requires an expensive and sophisticated detection process. Most people are affected due to the failure of the heart which seriously threatens their lives due to high morbidity and mortality. Therefore, accurate prediction and diagnosis are needed for early prevention, detection, and treatment to reduce the death threats to human life. However, an early and accurate prediction of HD is still a challenging task to be addressed. In this work, we propose a machine learning-based prediction model (MLbPM) that exploits a combination of the data scaling methods, the split ratios, the best parameters, and the machine learning algorithms for predicting HD. The performance of the proposed model is tested by performing experiments on a University of California Irvine HD dataset to indicate the presence or absence of HD. The results show that the proposed MLbPM provides an accuracy of 96.7% when logistic regression, robust scaler, best parameter, and 70 : 30 as a split ratio of the dataset are considered. In addition, MLbPM outperforms other compared works in terms of accuracy.

1. Introduction

Heart disease (HD) occupies the top place as a cause of death across the globe. According to the World Health Organization, almost 17.9 million deaths occur every year due to HD, estimated to be 31% of deaths globally [1]. HD describes any condition affecting the heart. The current generation is continuously very busy in everyday life due to the technological developments. This situation causes nervousness, restlessness, and stress, leading to HD. Early detection and classification could help to decide whether a patient has HD or not, and this prediction may prevent deaths. Electrocardiography and blood tests are often required for patients with HD symptoms to evaluate the disease appropriately [2, 3]. Medical diagnosis is a complicated and vital task, and various factors make it hard for physicians to assess and detect the type of HD. The prediction of HD, when done early, has shown its benefit, such as allowing people to save their money, gain health, and not waste time [4–8]. To this end, an automated prediction system could be advantageous as it would be utilized by nonmedical personnel too.

Machine learning (ML) is an artificial intelligence branch that operates by analyzing data and recognizing patterns with minimum human intervention. In order to forecast new output values, ML algorithms use historical data as input. The successful use of ML provides options that can grant benefits in a rural area where doctors and expensive equipment are limited. Different studies have made it feasible to develop a decision support system using data information and ML approaches and improve HD prediction [9, 10]. ML algorithms are expanding, and they have shown promising results in different applications, for example, online learning [11], scheduling [12], multiobjective optimization [13], and vehicle routing [14]. ML approaches are often preferred because they allow low computational cost and reasonable memory consumption. They are also useful when they are applied to efficient models, and their quality also depends on the quality of the data. Crucial data preprocessing steps such as data cleaning, feature selection, and data scaling are often required to get standard data. While most of the studies have been interested in the ML algorithm-based approach with the use of feature selection [3, 9], a minimum number of them paid attention to the impact of the combined split ratio, best parameter, and scaling methods’ influence on the model performance [10, 15, 16]. It is so challenging to decide at the same time which split ratios, best parameters, and scaling methods are suitable for the best model when the dataset often contains features that vary in degrees of magnitude, range, and units.

To address the abovementioned challenges, this work proposes an ML-based prediction model (MLbPM) which uses the best data scaling method, the split ratio, and the best parameter and evaluates classifiers by measuring the classification accuracy of various ML algorithms to predict HD. Specifically, this work combines four ML algorithms and three data scaling methods, with the default or best parameters and the split ratios to find the best match for HD prediction. More specifically, we use ML algorithms such as logistic regression (LR), Naive Bayes (NB), support vector machine (SVM), and -nearest neighbors (KNN), the involvement of the default or best parameter, the split ratio, and three data scaling methods such as standard scaler (SS), min–max scaler (MS), and robust scaler (RS).

The main contributions of this work are given as follows: (1)We propose a model called MLbPM, which focuses on split ratios, parameter tuning, and data scaling methods(2)We improve the HD prediction accuracy, reduce the false predictions, and compare the default and the best parameter in a matter of what is the best to use(3)We identify an appropriate prediction algorithm among the four ML algorithms in order to classify the given University of California Irvine (UCI) HD dataset, accurately(4)We validate the model performance using metrics like accuracy, score, precision, recall, and a receiver operating characteristic (ROC) curve. The experimental results show the superiority of the proposed model

The organization of the remaining parts of the work is as follows. Section 2 discusses related works. Section 3 explains the methodologies and the proposed system model. Section 4 carries out the analysis of the experimental results. Finally, Section 5 concludes the work and shows the possibility of the future work.

Most of the works consulted on HD prediction presented their outcomes in terms of accuracy according to the used ML algorithms. Then, the performance differs from study to study due to different ML algorithms. For example, Tu et al. [17] obtained an accuracy of 81.14% after using bagging and 78.9% after using a decision tree (DT). An accuracy of 84.14% was reached when Srinivas et al. [18] applied a naive approach and were able to correctly identify patients with HD. Classification and regression trees (CART) and DT have been used by Chaurasia and Pal [19], and their experiment showed an accuracy of 83.49% for CART and 82.50% for DT. Hari Ganesh and Gajenthiran [20] applied an NB approach and carried out an accuracy of 83.40%. SVM (linear and sigmoid) is used by Takçi [10] and got an accuracy of 84.81% and 84.44% when identifying HD patients. In addition, other works used the standard ML algorithms to predict HD, especially SVM with an accuracy of 84.85%, LR with 82.56%, and DT with 82.22% in [9], LR with 85.86% and Vote 87.4% in [16], SVM with 85% in [21], 84.12% in [22], and 83.6% and KNN with an accuracy of 78% [23]. Kavitha et al. [24] proposed a novel ML approach, and experimental results show an accuracy level of 88.7% with the hybrid model. Malavika et al. tested and compared various ML methods to predict HD where LR obtained 86.88% as accuracy [25].

Data scaling is one of the important techniques in order to scale and normalize data. Several techniques for data scaling are available; however, choosing appropriate scaling methods for ML algorithms is a challenging task to be considered. Some works mention the influence of data scaling practices on different ML algorithms. Ahsan et al. [15] tested different ML algorithms before and after scaling methods where LR obtained 84% accuracy. Shahriyari [26] showed that normalization significantly affected the performance of different ML algorithms. SVM had a maximum accuracy of 78%, compared to the other supervised algorithms. However, their work also showed that NB had the best precision and the lowest time. There is another work conducted by Ambarwari et al. [27], which showed that data scaling methods such as normalization and min–max normalization also significantly affect data analysis. Balabaeva and Kovalchuk [28] examined the effect of different scaling methods on the datasets from patients with heart failure. Various scaling methods, SS, MS, max Abs scaler, RS, and quantile transformer, have been used with LR, random forest, DT, and Xgboost.

The train/test split ratio could affect the outcome of the models and thus the prediction performance itself. However, only a few works considered the split ratio; for example, Rácz et al. [29] evaluated the effect of the train/test split ratios on the ML algorithm’s performance. They tested and compared several combinations of dataset sizes and split ratios. They applied them to five different ML algorithms to find the dissimilarities or resemblances. They noticed clearly that the distinctions depended not only on the applied ML algorithms but also on the dataset sizes, which may extend to the train/test split ratios.

From the abovementioned introduction and related works, one can notice that the work on the simultaneous use of ML algorithms, scaling methods, split ratio, parameter tuning, etc. has rarely been available in the current literature. Despite the utilization of data scaling methods in some works and the split ratios in others, according to the concerned work, the research work on the impact and interaction between the scaling methods, the train/test split ratios, and the best parameters combined is still missing in the current literature. This proposed work considers the combined impact of split ratio, parameter tuning, and scaling methods on the four ML algorithms (i.e., KNN, LR, NB, and SVM) for HD prediction.

3. The Proposed System Model

3.1. Dataset Description

In this work, we consider one of the frequently used UCI HD datasets, which can be found on Kaggle [30]. This dataset contains 14 attributes that include one target attribute and has 303 instances. The details of the dataset are presented in Table 1. This table comprises six numerical and eight categorical attributes.

According to the attributes, patients aged between 29 and 79 have been selected in this dataset and are considered for the experimental study. The gender value 1 has been assigned to male patients and 0 to female patients. There are four types of chest pain indicating HD. First of all, a kind named typical angina is caused by reduced blood flow to the heart muscles because of narrowed coronary arteries. The second one is called atypical angina, and it is a chest pain that occurs during mental or emotional stress. The third one, nonangina chest pain, may be caused due to various reasons. The last type is an asymptomatic type which may not be a symptom of HD. Trestbps reads the resting blood pressure. Chol informs about cholesterol level. Fbs is the fasting blood sugar level, and its assigned value is 1 if the fasting blood sugar is above 120 mg/dl and 0 if it is below. Restecg is known as a resting electrocardiographic result, while Thalach is the maximum rate of the heart. Indeed, Exang is exercise-induced angina. Oldpeak informs about ST depression. Slope informs about the pitch of the peak exercise ST. Ca tells the number of significant vessels colored by fluoroscopy. Thal is the exercise test duration in minutes, and Num is the target attribute. The presence or absence of HD is the target attribute which comprises binary values. The value 0 represents normal, and 1 signifies that the patient is confirmed with HD.

3.2. Data Preprocessing

The dataset used in this work does not have any null values. It has nominal attributes and numerical attributes. The dataset used is a medical dataset taken from Kaggle.

The attribute values of Age, Trestbps, Chol, Fbs, Thalach, and Oldpeak are numeric. They do not need any encoding. Sex, with the attribute value 2, Cp 4, Restecg 3, Exang 2, Slope 3, Ca 4, Thal 4, and Num 2 are nominal. These nominal features have to be encoded. Nominal and numeric attributes are visible in Table 1. For this work, we used 14 variables from the Cleveland UCI HD dataset. The most important considerations of preprocessing were to do encoding on nominal features. These nominal features have been encoded using a one-hot encoding function.

The data scaling action for numerical features is handled, as various ML algorithms require data scaling methods to produce the best results. We used three scaling methods: the first is SS which standardizes features with a zero mean and a standard deviation, and it makes the distribution become standardized. The second is MS which scales features in a given range . If there are negative values in the dataset, the range is . During the data scaling action, the shape of the original distribution is not changed. The third is RS, and it removes outliers and facilitates using SS or MS if needed. RS works with the quantile range.

3.3. ML Algorithms

In the medical domain, ML has one of the essential tasks to predict the correct classification. Usually, trained physicians and other medical professionals can perform interpretation. However, with the massive amount of data generated every day, it is very challenging to carry out the classification of data. Different ML algorithms can support physicians in clinical applications. They can reduce the working load of medical experts and clinicians in the procedures of prediction, diagnosis, or prognosis. Various ML algorithms can cope with prediction tasks, among many others. The following subsections provide a comprehensive understanding of some of the ML algorithms.

3.3.1. KNN

KNN is a supervised ML algorithm that is used to classify label datasets. It classifies objects according to the nearest neighbor. KNN calculates the distance of an attribute from its neighbors. When the data points are continuous attributes, this algorithm employs Euclidean distance (ED) given by equation (1) or Manhattan distance (MD) defined in equation (2) for measuring the distance between the data points. The hamming distance (HD) is defined in equation (3), which can be used when the attributes are categorical.

KNN considers all features of equal weight. It is a technique of choice for a classification problem when the dataset does not have numerous variables. This is the case of the present dataset used in this work. In case the features are too many, the best way is to make a feature selection.

3.3.2. LR

LR is a supervised ML algorithm that allows a mapping between a response variable and predictor variables. LR establishes relationships between variables and predicts a particular outcome. Assume that LR is ready to predict , the mathematical formula is as follows: where and are coefficients learned from the training set and are the input variables.

3.3.3. NB

NB is a supervised ML algorithm that cooperates with Bayes’ theorem. It is a probability-oriented classifier. It works by considering all variables in the dataset conditionally independent. This means that there is no correlation between the variables, and it is often useful for a sparse dataset. Here, the Bayes theorem is given:

where stands for probability. is calculated for a given element and its probability of occurrence, where is the probability of occurrence of element , is the probability of occurrence of element , and is the conditional probability of element given element occurs, and this theorem will be used to perform the classification. The aforementioned theorem would do a direct multiplication of the probability of each feature occurring for independent features.

3.3.4. SVM

SVM is a supervised ML that is used for binary classification problems in various fields, especially in the medicine field. In SVM, the calculation of the support vector is done across the decision boundary instead of calculating according to the distance across each data point. This support vector is a good tool, which is used to classify a given dataset.

3.4. The Proposed MLbPM

A model named MLbPM is proposed as given in Figure 1 and Algorithm 1. It utilizes LR, the split ratios, the best parameters, and the scaling methods. Let us consider the dataset , which contains feature variables, classifications, and a number of patient records. Let us consider a classification problem where sample is assigned to one of the potential classification classes () in the applied dataset. In our work, the dataset outcome is divided into two groups: normal or healthy people and abnormal patients. Let us name it class for normal (value 0) and class for abnormal (value 1).

1: Load the HD dataset
2: Identify total number of patient records PR in
3: Find out nominal features NF in
4: fordo
5: fordo
6: Apply one-hot encoding on each NF
7: end for
8: Return
9: end for
10: Partition into two sets: training TR and testing TS data
11: Select base classifier algorithms CA
12: fordo
13: Consider split ratio 80 : 20 SR1 and 70 : 30 SR2
14: Predict the result of on TR with
15: Predict the result of on TR with SR2
16: Tune the hyperparameter of and go to step 12
17: end for
18: fordo
19: Evaluate the performance of on TS
20: Predict the result of on TS, save the results
21: end for
22: Get in step 8 and go to step 23
23: Identify numeric features NC in
24: fordo
25: Identify scaling methods SM
26: Scale each NC with a SM
27: Return
28: end for
29: Execute steps 10 to 17 for , then go to step 30
30: fordo
31: Evaluate the performance of each classifier on TS
32: Predict the result of each classifier on TS, save the results
33: end for
34: Select and validate the best model

After loading the HD dataset, we do preprocess our data and split the dataset into two subsets, one for training data and another one for testing data. The formula followed for splitting our dataset is

It is normal for the training data to be superior to the testing data, and the sum of the training data and the testing data makes the whole dataset considered for an experiment as given in equation (6). Our dataset is divided into two split ratios SR1 and SR2 known as 80 : 20 and 70 : 30, respectively, where 80 and 70 ratios show the training data while 20 and 30 represent the testing data. We also use the scaling methods SS, MS, and RS, and four base classifiers are trained on training samples. The hyperparameters of base classifiers are tuned to improve the predictive accuracy, and the trained model is tested for evaluation.

The experiments follow two options: one without scaling methods and another one with scaling methods. The first option is done, on the one hand, considering the 80 : 20 split ratio with the default or best parameter on each ML algorithm such as KNN, LR, NB, and SVM. These ML algorithms are considered as base classifiers for training, and the models created are used for the test. The parameters of KNN, LR, NB, and SVM are tuned using the hyperparameter optimization technique grid search cross-validation in order to get the best parameters. On the other hand, the 70 : 30 split ratio is used by repeating the same procedures as applied on 80 : 20 split ratio. The LR model is the performant model even when both split ratios and best parameters are considered. The work in hand of LR, 70 : 30 as a split ratio, and the best parameter produce good accuracy.

The other option concerns using the data scaling methods: SS, MS, and RS are applied to the input attributes. Each scaling method is used with each split ratio and the default or best parameter. The same previous classifiers are tested, and the LR classifier is the effective classifier at any time when both split ratios, best parameters, and RS are considered. The LR, the 70 : 30 split ratio, the best parameter, and RS make a proposed MLbPM more accurate than the standard ML algorithms.

As shown in Algorithm 1, first dataset is loaded, and then, all nominal features (denoted by NF) are identified. After that, each nominal feature is encoded (lines 1 to 9), and is divided into two sets, one for training and another for testing (line 10). base classifiers are selected (line 11) for classifying HD. Subsequently, predictions are made according to each classifier and a chosen split ratio (lines 12 to 15). After that, the hyperparameter is tuned to improve the result of each base classifier (line 16). The performance evaluation is executed, and the results are saved (line 20). In addition, the numeric features are identified and scaled using scaling methods (lines 23 to 28). After that, the previous actions are repeated from lines 10 to 17 (line 29). Subsequently, all classifiers’ performances are evaluated (line 31), and the results are saved again (line 32). Finally, after comparing the results of all classifiers, the best model is selected and validated (line 34).

4. Experimental Results

An experimental work with various classification algorithms on the Cleveland UCI HD dataset has been done. Some algorithms showed good accuracy, whereas others performed poorly. This work has used three types of scaling methods such as SS, MS, and RS to improve the performances of four different ML algorithms like LR, KNN, NB, and SVM. A train/test split function with the number five as a random state is applied to divide the dataset into training and testing data. The split ratios of 80 : 20 and 70 : 30 are used for each experiment.

4.1. Metrics Used

Distinguished performance indices are considered to validate the integrity of our research results. They are accuracy, score, precision, and recall. Those indices are calculated using true positive (TP), true negative (TN), false positive (FP), and false negative (FN). The performance metrics are given as follows: (1)Accuracy: it is a metric for evaluating classification models. It is the fraction of predictions that the model got right as it is shown in equation (7) and helps to measure how well the model works. The formula is as follows: (2) score: it is a weighted average of precision and recall, and it conveys the balance between precision and recall. Equation (8) clarifies the score formula. The following is the mathematical formula of score: (3)Precision: it is a metric that quantifies the number of correct positive predictions made as it is in equation (9). It is used to measure how many of the samples predicted as positive are actually positive. Its formula is as follows: (4)Recall: it is a metric that quantifies the number of correct positive predictions made out of all positive predictions that could have been made. It helps to measure how many of the positive samples are captured by the positive predictions as shown in equation (10). The formula is as follows:

TP is known as true positive and represents the number of persons correctly predicted with HDs. TN signifies true negative and represents the persons tested accurately as negative. FP is false positive and refers to those persons that are wrongly tested as positive. FN is known as false negative and indicates the persons incorrectly predicted as negative.

4.2. The Impact of Split Ratio and Best Parameter on ML Algorithms

Table 2 presents the overall accuracy, score, precision, and recall for all the four used ML algorithms without scaling methods. These algorithms are used with two types of parameters, such as the default and best parameters, respectively. The results presented in Table 2 are done according to the split ratio of 70 : 30 and the best parameters that provided good results.

With the split ratio of 70 : 30 and the best parameter, LR shows the highest accuracy of 94.2%, compared to the other three ML algorithms. LR uses the best parameters such as , maxiter, penalty, and solver with 0.0001, 5000, none, and sag as values, respectively. On the other hand, KNN reveals the lowest performance with 64.83% of accuracy. LR also produces the highest score of 94.67%, while KNN has the lowest score of 68.62%.

With the same split ratio and best parameter above, the LR shows the highest precision of 95% for class 0 and 92% for class 1. On the other hand, KNN gives the lowest precision of 65% for 0 and 65% for 1. LR offers the highest recall, 91% for 0 and 96% for 1, while KNN has the most inadequate recall, 56% for 0 and 73% for 1. To further validate our results, we applied an ROC curve as demonstrated in Figure 2. LR performs better than KNN, NB, and SVM.

4.3. The Impact of the Split Ratio on LR

Table 3 presents the impact of the split ratio on the model’s results. Practically, LR gained 2.4% additional accuracy when used with a split ratio of 70 : 30 and the best parameter than a split ratio of 80 : 20 and with the default parameter. With 80 : 20 as a split ratio, LR obtained an accuracy of 91.8%, 92.06% for score, a precision of 93% for class 0 and 91% for 1, and a recall of 90% for class 0 and 94% for 1. However, LR obtains an accuracy of 94.2%, 94.67% for score, a precision of 95% for class 0 and 92% for 1, and a recall of 91% for class 0 and 96% for 1, when used with a split ratio of 70 : 30. The 70 : 30 split ratio provides better results than the 80 : 20 split ratio.

This ROC curve is used to validate the performance of obtained results, according to 70 : 30 and 80 : 20 split ratio. Considering the performant LR, it is shown that the 70 : 30 split ratio provided the good results compared to 80 : 20, as shown in Figure 3.

4.4. The Impact of the Best Parameter on LR

Table 4 presents the effectiveness of the best parameters on the results. When LR is used with the default parameter, the accuracy is 93.4%, 93.75% for score, a precision of 93% for class 0 and 94% for 1, and a recall of 93% for class 0 and 94% for 1. However, LR obtains an accuracy of 94.2%, 94.67% for score, a precision of 95% for class 0 and 92% for 1, and a recall of 91% for class 0 and 96% for 1, when it is used with the best parameter.

According to LR, default, and the best parameter, we applied a ROC curve to validate the results further. LR, with the best parameter, remains with good performance than with the default parameter as shown in Figure 4.

4.5. Impact of the Scaling Methods

Table 5 presents accuracy, score, precision, and recall for LR as an effective algorithm with the best parameter and the split ratio of 70 : 30 and with the scaling methods such as SS, MS, and RS. With the scaling methods, LR algorithm shows the best results. It shows the highest accuracy of 96.7% when applied with RS and uses , maxiter, penalty, and solver as the best parameters with 0.08858668, 100, l2, and lbfgs as values, respectively. In terms of score, LR obtains an score of 96.75% and shows the highest precision of 95% for class 0 and 96% for 1 with RS. LR has the highest recall of 93% for class 0 and 94% for 1, when used with RS.

As can be seen in Table 5, the accuracy obtained before scaling and after scaling (using RS) is 94.2% and 96.7%, respectively, which shows that the accuracy improves up to 2.5% after applying the scaling methods. Thus, scaling methods have great impact on the performance of the proposed model/algorithm.

To further validate our model’s results, we applied the ROC curve as demonstrated in Figure 5, and our findings demonstrate that the results after scaling methods are better than before scaling, according to our model.

4.6. Comparison with Other Studies

As discussed in the related work section, Amin et al. [16] and Bashir et al. [9] tested standard ML algorithms without applying scaling methods. In contrast, Balabaeva and Kovalchuk [28] and Ahsan et al. [15] tested and evaluated the standard ML algorithms before and after scaling. They did not mention default and best parameters, but they all used 80 : 20 as a split ratio. To compare our results with those available in the related works, we used only the 80 : 20 split ratio without and with scaling methods for a fair comparison, because the other authors do not use the 70 : 30 split ratio. The comparison results are shown in Table 6: column 2 presents the accuracy of the proposed method compared with other works, and they clarify that our proposed MLbPM based on LR is more performant than the models proposed in [9, 15, 16, 28] without considering scaling methods. In column 3, the results of Amin et al. and Bashir et al. are missing because they did not use scaling methods. As it is shown in Table 6, our model’s results are still the best than in [15, 28] after having used SS, MS, and RS as scaling methods on LR.

To further prove the effectiveness of the proposed MLbPM, we present the evaluation of accuracy compared with others works in Figures 6 and 7. Figure 6 presents the evaluation of accuracy of MLbPM with other compared works before scaling. In contrast, Figure 7 shows the comparison of the evaluation of accuracy obtained by the proposed MLbPM compared with other works after scaling. Figures 6 and 7 show that our proposed model performs better than all the compared works and provides more promising results than them before and after scaling.

4.7. Discussion

In this work, the performances of four different ML algorithms (i.e., KNN, LR, NB, and SVM) were analyzed by considering the split ratios, the parameter tuning, and the data scaling approaches. We evaluated the abovementioned ML algorithms with the 80 : 20 and 70 : 30 split ratios, the default and best parameters, and different scaling methods such SS, MS, and RS. As results, we noticed that among the four mentioned ML algorithms, LR provides better results.

According to the results, we remarked that the 70 : 30 split ratio is the best split ratio which has given the best result as it is shown in Table 3. By testing the default and best parameter for LR, we concluded that the best parameter provided better results compared to the default parameter as it is given in Table 4.

We tested different scaling methods and evaluated the proposed MLbPM model, before scaling and after scaling. We concluded that, after scaling, the proposed model provides better results than others and achieved an accuracy of 96.7% as given in Table 5. In addition, we tested different scaling methods and noticed that, among the scaling methods, RS is the best scaling method as shown in Tables 5 and 6 and has a great impact on the results.

5. Conclusion

This work evaluated four ML algorithms (i.e., KNN, LR, NB, and SVM) and three distinct data scaling methods to detect patients with HD using the UCI HD dataset. We proposed a model called the ML-based prediction model (MLbPM) which is the combination of the split ratio, the best parameters, the data scaling methods, and the ML algorithms. The proposed MLbPM was evaluated by performing various experiments using the UCI HD dataset. According to the results, the split ratio, the parameter tuning, and the data scaling methods influence the algorithm’s performance. The experimental results show that the proposed MLbPM provides better accuracy of 96.7% by using the LR algorithm with a split ratio of 70 : 30, best parameter, and scaling method robust scalar. In addition, as it can be seen in experimental results, MLbPM outperforms other compared works in terms of accuracy. We believe that this proposed MLbPM will guide the researchers or practitioners performing their HD medical tasks. In the future, we plan to design some fast and low-complexity algorithms based on artificial intelligence keeping in view the demand for real-time HD prediction in the smart E-health care system.

Data Availability

The datasets generated during and analyzed during the current study are available from the corresponding authors upon reasonable request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research was funded by the Hunan Key Laboratory for Internet of Things in Electricity (Grant No. 2019TP1016), the National Natural Science Foundation of China (Grant No. 72061147004), the National Natural Science Foundation of Hunan Province (Grant No. 2021JJ30055), and the project about research on key technologies of power knowledge graph (Grant No. 5216A6200037) and was also supported by the EIAS Data Science Lab, College of Computer and Information Sciences, Prince Sultan University, Riyadh, Saudi Arabia.

References

WHO World Health Organization, Cardiovascular diseases,(cvds)-key facts, 2021.
E. E. Tripoliti, T. G. Papadopoulos, G. S. Karanasiou, K. K. Naka, and D. I. Fotiadis, “Heart failure: diagnosis, severity estimation and prediction of adverse events through machine learning techniques,” Computational and Structural Biotechnology Journal, vol. 15, pp. 26–47, 2017.
View at: Publisher Site | Google Scholar
A. P. Pawlovsky, “An ensemble based on distances for a KNN method for heart disease diagnosis,” in 2018 International Conference on Electronics, Information, and Communication (ICEIC), Honolulu, HI, USA, 2018.
View at: Google Scholar
W. P. Lord and D. C. Wiggins, “Medical decision support systems,” in Advances in Health Care Technology Care Shaping the Future of Medical, pp. 403–419, Springer, Berlin/Heidelberg, 2006.
View at: Publisher Site | Google Scholar
M. Hammad, A. A. Abd El-Latif, A. Hussain et al., “Deep learning models for arrhythmia detection in IoT healthcare applications,” Computers and Electrical Engineering, vol. 100, p. 108011, 2022.
View at: Google Scholar
A. Sedik, M. Hammad, A. A. Abd El-Latif et al., “Deep learning modalities for biometric alteration detection in 5g networks-based secure smart cities,” IEEE Access, vol. 9, pp. 94780–94788, 2021.
View at: Publisher Site | Google Scholar
M. Hammad, M. H. Alkinani, B. B. Gupta, A. El-Latif, and A. Ahmed, “Myocardial infarction detection based on deep neural network on imbalanced data,” Multimedia Systems, pp. 1–13, 2021.
View at: Google Scholar
M. Bukhari, S. Yasmin, S. Sammad, A. El-Latif, and A. Ahmed, “A deep learning framework for leukemia cancer detection in microscopic blood samples using squeeze and excitation learning,” Mathematical Problems in Engineering, vol. 2022, Article ID 2801227, 18 pages, 2022.
View at: Publisher Site | Google Scholar
S. Bashir, Z. S. Khan, F. H. Khan, A. Anjum, and K. Bashir, “Improving heart disease prediction using feature selection approaches,” in 2019 16th International Bhurban Conference on Applied Sciences and Technology (IBCAST), pp. 619–623, Islamabad, Pakistan, 2019.
View at: Google Scholar
H. Takçi, “Improvement of heart attack prediction by the feature selection methods,” Turkish Journal of Electrical Engineering and Computer Sciences, vol. 26, pp. 1–10, 2018.
View at: Publisher Site | Google Scholar
Y. Luo, X. Han, and C. Zhang, “Prediction of learning outcomes with a machine learning algorithm based on online learning behavior data in blended courses,” Asia Pacific Education Review, 2022.
View at: Publisher Site | Google Scholar
M. Frye, D. Gyulai, J. Bergmann, and R. H. Schmitt, “Adaptive scheduling through machine learning-based process parameter prediction,” MM Science Journal, vol. 2019, no. 4, pp. 3060–3066, 2019.
View at: Publisher Site | Google Scholar
S. Mahjoubi, R. Barhemat, P. Guo, W. Meng, and Y. Bao, “Prediction and multi-objective optimization of mechanical, economical, and environmental properties for strain-hardening cementitious composites (SHCC) based on automated machine learning and metaheuristic algorithms,” Journal of Cleaner Production, vol. 329, p. 129665, 2021.
View at: Publisher Site | Google Scholar
R. Bai, X. Chen, Z. L. Chen et al., “Analytics and machine learning in vehicle routing research,” International Journal of Production Research, vol. 61, no. 1, pp. 4–30, 2021.
View at: Google Scholar
M. M. Ahsan, M. A. Parvez Mahmud, P. K. Saha, K. D. Gupta, and Z. Siddique, “Effect of data scaling methods on machine learning algorithms and model performance,” Technologies, vol. 9, no. 3, p. 52, 2021.
View at: Publisher Site | Google Scholar
M. S. Amin, Y. K. Chiam, and K. D. Varathan, “Identification of significant features and data mining techniques in predicting heart disease,” Telematics and Informatics, vol. 36, pp. 82–93, 2019.
View at: Publisher Site | Google Scholar
T. My Chau, D. Shin, and D. K. Shin, “Effective diagnosis of heart disease through bagging approach,” in 2009 2nd International Conference on Biomedical Engineering and Informatics, Tianjin, China, 2009.
View at: Google Scholar
B. Konda Srinivas, K. Rani, and A. Govrdhan, “Applications of data mining techniques in healthcare and prediction of heart attacks,” International Journal on Computer Science and Engineering, vol. 2, no. 2, pp. 250–255, 2010.
View at: Google Scholar
V. Chaurasia and S. Pal, “Early prediction of heart diseases using data mining techniques,” Caribbean Journal of Science and Technology, vol. 1, pp. 208–217, 2013.
View at: Google Scholar
S. Hari Ganesh and M. Gajenthiran, “Comparative study of data mining approaches for prediction heart diseases,” Journal of Engineering, vol. 4, no. 7, 39 pages, 2014.
View at: Publisher Site | Google Scholar
S. Shilaskar and A. Ghatol, “Feature selection for medical diagnosis: evaluation for cardiovascular diseases,” Expert Systems with Applications, vol. 40, no. 10, pp. 4146–4153, 2013.
View at: Publisher Site | Google Scholar
N. Kausar, S. Palaniappan, S. B. Belhaouari, A. B. Abdullah, and N. Dey, “Systematic analysis of applied data mining based optimization algorithms in clinical attribute extraction and classification for diagnosis of cardiac patients,” Applications of Intelligent Optimization in Biology and Medicine, vol. 96, pp. 217–231, 2015.
View at: Google Scholar
Y. Khourdifi and M. Bahaj, “K-nearest neighbour model optimized by particle swarm optimization and ant colony optimization for heart disease classification,” in Big Data and Smart Digital Environment, pp. 215–224, Springer International Publishing, 2019.
View at: Google Scholar
M. Kavitha, G. Gnaneswar, R. Dinesh, Y. Rohith Sai, and R. Sai Suraj, “Heart disease prediction using hybrid machine learning model,” in Proceedings of the 6th International Conference on Inventive Computation Technologies (ICICT 2021), pp. 1329–1333, Coimbatore, India, 2021.
View at: Google Scholar
G. Malavika, N. Rajathi, V. Vanitha, and P. Parameswari, “Heart disease prediction using machine learning algorithms,” Bioscience Biotechnology Research Communications, vol. 13, no. 11, pp. 24–27, 2020.
View at: Publisher Site | Google Scholar
L. Shahriyari, “Effect of normalization methods on the performance of supervised learning algorithms applied to HTSeq-FPKM-UQ data sets: 7SK RNA expression as a predictor of survival in patients with colon adenocarcinoma,” Briefings in Bioinformatics, vol. 20, no. 3, pp. 985–994, 2017.
View at: Publisher Site | Google Scholar
A. Ambarwari, Q. J. Adrian, and Y. Herdiyeni, “Analysis of the effect of data scaling on the performance of the machine learning algorithm for plant identification,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 4, no. 1, pp. 117–122, 2020.
View at: Publisher Site | Google Scholar
K. Balabaeva and S. Kovalchuk, “Comparison of temporal and non-temporal features effect on machine learning models quality and interpretability for chronic heart failure patients,” Procedia Computer Science, vol. 156, pp. 87–96, 2019, 8th International Young Scientists Conference on Computational Science, YSC2019, 24-28 June 2019, Heraklion, Greece.
View at: Google Scholar
A. Rácz, D. Bajusz, and K. Héberger, “Effect of dataset size and train/test split ratios in QSAR/QSPR multiclass classification,” Molecules, vol. 26, no. 4, p. 1111, 2021.
View at: Publisher Site | Google Scholar
Heart disease dataset, 2021, https://www.kaggle.com/ronitf/heart-disease-uci.

Copyright

Copyright © 2023 Pierre Claver Bizimana et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

BioMed Research International

Computer-Aided Diagnosis of Pleural Mesothelioma: Recent Trends and Future Research Perspectives

[Retracted] An Effective Machine Learning-Based Model for an Early Heart Disease Prediction

Abstract

1. Introduction

2. Related Works

3. The Proposed System Model

3.1. Dataset Description

3.2. Data Preprocessing

3.3. ML Algorithms

3.3.1. KNN

3.3.2. LR

3.3.3. NB

3.3.4. SVM

3.4. The Proposed MLbPM

4. Experimental Results

4.1. Metrics Used

4.2. The Impact of Split Ratio and Best Parameter on ML Algorithms

4.3. The Impact of the Split Ratio on LR

4.4. The Impact of the Best Parameter on LR

4.5. Impact of the Scaling Methods

4.6. Comparison with Other Studies

4.7. Discussion

5. Conclusion

Data Availability

Conflicts of Interest

Acknowledgments

References

Copyright