Research Article

Preprocessing Approach for Power Transformer Maintenance Data Mining Based on k-Nearest Neighbor Completion and Principal Component Analysis

Table 4

Statistical data after imputations.

Imputation by KNN (k = 5)Imputation by meanImputation multiple
VariableObsObs. with MVObs. without MVMeanStandard deviationMeanStandard deviationMeanStandard deviation

H23103180,32318,73480,03418,66579,06419,062
CH43103130,37410,07430,38710,07430,29110,088
C2H43103130,81910,83231,08010,73430,77210,870
CO310311174,903293,0621174,400293,0491184,598298,499
CO2310319045,4843072,5979159,4673006,3429173,0763007,297
O2310315591,9685126,7005591,9685126,705591,9685126,700
N23103171286,03210399,78771206,67910264,42371291,44610547,466

MV: missing value.