Research Article

Developing Machine Learning and Statistical Tools to Evaluate the Accessibility of Public Health Advice on Infectious Diseases among Vulnerable People

Table 3

Performance of Gaussian Naïve Bayes (GNB) classifiers with different feature sets.

ModelTechniquesTraining (5-fold CV)Testing
AUC mean (SD)AUCAccuracyMacro F1SensitivitySpecificity

1MLS + POS full (69)0.971 (0.0212)0.9400.9210.9920.9440.8824
2MLS + POS jointly optimised (6)0.998 (0.0026)0.9930.9400.9430.9630.9118
3MLS full (26 features)0.997 (0.003)1.00.9660.9631.00.9118
4MLS optimised (2 features)0.998 (0.004)1.00.9430.9381.00.8529
5POS full (46 features)0.959 (0.0238)0.9070.8520.8430.8890.7941
6POS optimised (8 features)0.982 (0.0166)0.9680.9550.9511.00.8824
7MLS + POS separately optimised (10)1.0 (0)1.00.9550.9511.00.8824
8Refined MLS + POS separately optimised (2)0.995 (0.0079)0.9990.9890.9880.9821.0