Research Article

Using an Efficient Detection Method to Prevent Personal Data Leakage for Web-Based Smart City Platforms

Table 4

Classification of file name extensions for 50 websites.

File name extensionNo. of total filesNo. of files with personal dataPercentage of files with personal dataNo. of repeated personal dataNo. of repeated personal data per fileNo. of nonrepeated personal dataAverage nonrepeated personal data per file
(A)(B)(C = B/A)(D)(E = D/B)(F)(G = F/B)

Word3,3812,54675.30%25,1469.888,9523.52
Excel51336270.57%12,85235.503,77210.42
PPT876675.86%3184.822583.91
PDF9,6497,67979.58%201,08326.1937,4884.88
HTML64,26451,26479.77%1,031,50320.1231,6920.62
Total77,89461,9171,270,90282,162