Review Article

Modern Machine-Learning Predictive Models for Diagnosing Infectious Diseases

Table 3

Dataset specifications for the reviewed articles.

ReferenceSizeData acquisition methodInfectious disease

[19]Time-series of physiological dataWearable sensors to collect multivariate physiological dataTetanus and HFMD
[20]60 individualsMedical sensors, hub, and Android-based app to collect vital signsSkin and soft tissue infection, urinary tract infection, and acute respiratory infection
[21]49,721 usersApp-based symptom trackerSARS-CoV-2 (COVID-19)
[22]88 individualsCellular phone voice recordingsSARS-CoV-2 (COVID-19)
[23]37,599 tweetsSocial media messagesLatent infectious diseases
[24]1,317,018 classes, 7,731,914 axioms, and 1,269,340 inheritance relationsMultiple medical ontologies507 infectious diseases
[25]31268 reportsNLP tool (Topaz) to extract influenza-related findingsInfluenza
[26]52,306 patientsRoutine blood testsSARS-CoV-2 (COVID-19)
[27]1391 patientsRoutine laboratory resultsSARS-CoV-2 (COVID-19)
[28]295 patientsClinical recordsCandidemia
[29]1118 patientsEHRCDI
[30]152 patientsClinical records and chest CT imagesSARS-CoV-2 (COVID-19)
[31]2482 imagesCT scan imagesSARS-CoV-2 (COVID-19)
[32]56,081 patientsHistorical and real-time data25 infectious diseases

SARS-CoV-2: severe acute respiratory syndrome coronavirus 2; EHR: electronic health record; HFMD: hand, foot, and mouth disease; CDI: Clostridium (Clostridioides) difficile infection; CT: computed tomography.