Research Article

Modified Password Guessing Methods Based on TarGuess-I

Table 2

Basic information about our datasets.

DatasetWeb serviceWhen leakedTotalWith PIIContentBe used

Tianya (Tinya)Social forum201131,006,59028,158User name, PW, E-mail[12, 14, 38, 39]
Dodonew (Dodon)E-commerce201116,258,89121,854User name, PW, E-mail[12, 14, 17, 39]
Shengda (Senda)Game201115,313,33438,203User name, PW, E-mailNone
12306 (12306)Train ticketing2014232,884232,884PW, E-mail, PII[16, 17]
Aipai (Aipai)Video blogs20167,682,23227,917User name, PW, E-mailNone
Youku (youku)Audio visual201692,547,261134,863PW, E-mailNone

Hereinafter, each data source is referred as the bold shorthand notations in the brackets.