Research Article

Supervised Machine Learning Based Noninvasive Prediction of Atrial Flutter Mechanism from P-to-P Interval Variability under Imbalanced Dataset Conditions

Table 5

Comparison in the percentage of the augmented dataset by descriptive statistics on the minority data (focal AFL).

MeanVarianceSkewness
SMOTEModifiedSmoothed-bootstrapSMOTEModifiedSmoothed-bootstrapSMOTEModifiedSmoothed-bootstrap

Difference between statistics of the original dataset and the augmented dataset (in percentage)
100−0.19−0.170.4325.5210.65−4.3618.3419.983.45
200−0.19−0.140.4325.749.69−4.2723.8719.744.69
300−0.18−0.170.4225.6110.59−4.0120.7018.824.43
400−0.20−0.160.4125.9310.59−3.8320.3518.924.85
500−0.19−0.170.4425.959.95−4.2621.5120.114.51
600−0.20−0.150.4125.9310.26−3.7821.5320.014.81
700−0.19−0.170.4225.4510.77−3.9721.0020.144.42
800−0.19−0.170.4125.8510.55−4.0422.8620.805.15

There are three augmentation techniques (Smote, Modified-Smote, and Smoothed-Bootstrap) for each parameter (Mean, Variance, and Skewness). The technique with the minimum difference is indicated in bold font.