Research Article

Big Data Privacy Preservation Using Principal Component Analysis and Random Projection in Healthcare

Table 1

Description of datasets.

Name of datasetNumber of instancesNumber of attributesAttribute description

Cardiovascular disease dataset70K13Id, age, gender, height, weight, ap_hi, ap_lo, cholesterol, gluc, smoke, alco, active, class
Hypothyroid disease dataset720021Age, sex, on thyroxine, query on thyroxine, on antithyroid medication, sick, pregnant, thyroid surgery, I131 treatment, query hypothyroid, query hyperthyroid, lithium, goiter, tumor, hypopituitary, psych TSH measured, TSH, T3 measured, T3, TT4 measured, TT4, T4U measured, T4U, FTI measured, class