Research Article

Developing Machine Learning and Statistical Tools to Evaluate the Accessibility of Public Health Advice on Infectious Diseases among Vulnerable People

Table 1

Mann-Whitney U test.

MLS features (1–26) POS (27–72)EHR meanEHR std.RHR meanRHR std.Asymp. sig. (2-tailed)Effect size dCohen (corrected effect size or Hedges’ g)Common language effect size CLES95% CIMann-Whitney UWilcoxon WZ

Average sentences per paragraph0.6010.3453.3161.1630.0002.5880.9662.301 to 2.87452.004238.00−14.669
TTR0.4610.1190.6230.0790.0001.8280.9021.568 to 2.0884389.008575.00−10.680
Difficult words141.81396.38170.87235.9860.000−1.3110.823−1.558 to −1.0658769.5070897.50−6.657
Low-stroke characters641.593444.264284.707140.9300.000−1.5070.857-1.758 to−1.2568108.0070236.00−7.264
Middle-stroke characters120.39680.34851.81027.6950.000−1.5620.865−1.814 to−1.317357.0069485.00−7.955
High-stroke characters0.0550.2290.4521.6040.0050.2770.5780.045 to 0.50814144.0018330.00−2.815
Average strokes per character7.6770.2747.7110.3420.1950.1030.529−0.127 to 0.33414604.0018790.00−1.297
2-character words256.637176.783114.34156.2620.000−1.5090.857−1.76 to −1.2588100.5070228.50−7.271
3-character words13.00011.8248.2986.3720.008−0.6030.665−0.837 to −0.36913139.5075267.50−2.647
Average words per sentences11.5924.96311.8931.9620.0180.1060.530-0.125 to 0.33613438.0017624.00−2.368
Single sentences0.8830.1140.4600.1910.000−2.3760.954−2.655 to −2.098949.0063077.00−13.851
Ratio of noun phrases0.2670.1040.4150.1980.0000.8100.7170.573 to 1.0468396.5012582.50−6.999
Frequency of noun phrases per 10K322.36937.240314.66437.0020.028−0.2080.558−0.439 to 0.02313620.5075748.50−2.200
Average idioms per sentences0.0010.0040.0120.0290.0040.4240.6180.192 to 0.65613969.0018155.00−2.894
Content words392.582265.421163.07480.5380.000−1.6420.877−1.896 to −1.3876988.5069116.50−8.293
Adverbs of negation2.7583.0670.9351.2280.000−1.0320.767−1.272 to −0.7929522.5071650.50−6.283
Sentences with complex semantic categories25.39622.0397.6144.7580.000−1.6430.877−1.898 to −1.3886696.5068824.50−8.578
Density of content words0.8280.0260.8150.0310.000−0.4330.620−0.665 to −0.211857.5073985.50−3.820
Average logarithmic frequency of content words1.7380.1691.3370.1830.000−2.2250.942−2.498 to −1.9521854.0063982.00−13.009
Idioms0.0770.2680.2240.5100.0090.3120.5870.081 to 0.54414168.0018354.00−2.623
Pronouns41.69235.0431.4691.6520.000−2.5300.963−2.814 to −2.245620.5062748.50−14.398
Personal pronouns37.86831.8380.7051.1390.000−2.5770.966−2.864 to −2.291326.0062454.00−15.334
Conjunctions18.96716.44011.5206.4070.001−0.7950.713−1.031 to −0.55812507.5074635.50−3.228
Positive conjunctions16.82414.1599.0005.1930.000−0.9910.758−1.23 to -0.75110808.0072936.00−4.795
Negative conjunctions0.8461.5841.4401.3930.0000.4140.6150.182 to 0.64610710.5014896.50−5.070
Difficult words ratio30.3845.72435.8478.1750.0000.7060.6910.471 to 0.9419400.5013586.50−6.077
A3.0993.7743.0262.9690.236−0.0230.507−0.254 to 0.20714740.5018926.50−1.184
VI0.1430.4850.1590.4240.3480.0370.510−0.194 to 0.26715418.5019604.50−0.938
Dk0.0110.1050.0170.1300.6800.0480.514−0.183 to 0.27815919.0020105.00−0.413
VG2.1542.4542.0541.9190.505−0.0490.514−0.28 to 0.18115305.0019491.00-0.666
Nv9.96710.3909.2906.7110.083−0.0890.525−0.32 to 0.14214132.0018318.00−1.734
Neqb0.0330.1800.0570.2770.5900.0920.526−0.138 to 0.32315810.0019996.00−0.539
Cab0.1210.3900.3980.7990.0000.3770.6050.145 to 0.60913151.0017337.00−3.534
I0.0330.1800.0000.0000.001−0.4060.613−0.638 to −0.17415488.0077616.00−3.414
VAC0.1540.3920.0480.2150.001−0.4060.613−0.638 to −0.17414493.0076621.00−3.214
Nd4.2537.8722.2983.4080.002−0.4180.616-0.65 to −0.18612764.5074892.50−3.050
Nb1.4512.4230.7391.4200.000−0.4250.618−0.657 to −0.19312369.5074497.50−3.789
Dfb0.0660.2910.0030.0530.000−0.4510.625−0.683 to −0.21915181.0077309.00−3.831
Neu4.5056.3392.3953.2090.001−0.5210.644−0.754 to −0.28812438.0074566.00−3.347
VJ10.0118.8366.4555.1250.000−0.5860.661−0.82 to −0.35211896.0074024.00−3.797
VL2.9122.9461.7101.7410.001−0.5880.661−0.821 to −0.35412457.0074585.00−3.343
Cba0.1100.3790.0030.0530.000−0.6020.665−0.836 to −0.36914652.5076780.50−5.125
Caa12.84611.1178.7055.1500.103−0.6080.666−0.842 to −0.37414244.5076372.50−1.631
VB0.9121.4880.3210.7220.000−0.6350.673−0.869 to −0.40112839.5074967.50−3.789
VHC0.5271.0891.7051.9580.0000.6490.6770.415 to 0.8849100.0013286.00−6.641
Da0.5161.0580.1250.3480.000−0.6860.686−0.921 to −0.45112813.0074941.00-4.647
Nes2.0332.1000.9601.2780.000−0.7230.696−0.959 to -0.48810783.5072911.50−5.088
Dfa3.4733.8541.6081.7520.000−0.7970.713−1.033 to −0.56110952.0073080.00−4.769
VH22.15415.91313.7678.2220.000−0.8170.718−1.053 to −0.5811401.0073529.00−4.243
Nf7.4188.0713.0943.9560.000−0.8520.727−1.089 to −0.6159110.5071238.50−6.401
Nc7.7807.7812.6885.3030.000−0.8640.729−1.101 to −0.6278121.0070249.00−7.430
Nep3.5274.3881.3271.6530.000−0.8900.736−1.128 to −0.65312279.5074407.50−3.559
T1.1651.6620.2610.6490.000−0.9530.750−1.192 to −0.71510192.0072320.00−6.926
Di2.9013.1690.9121.5380.000−1.0030.761−1.243 to −0.7639200.5071328.50−6.755
SHI4.3964.9421.6761.7090.000−1.0060.762−1.246 to −0.76610776.0072904.00−4.917
VD1.7032.3550.3750.8610.000−1.0120.763−1.252 to −0.7729587.5071715.50−7.241
Cbb5.8906.4022.4152.0010.000−1.0220.765−1.263 to −0.78210325.5072453.50−5.297
Neqa6.0115.9972.5372.2270.000−1.0340.768−1.274 to −0.79410036.5072164.50−5.553
Na113.20977.72164.63932.6060.000−1.0650.774−1.306 to −0.82411326.5073454.50−4.308
Ng6.4738.0482.0681.9800.000−1.0900.780−1.331 to −0.8488874.5071002.50−6.668
V_24.5605.8791.1281.2140.000−1.1970.801−1.44 to −0.9539003.5071131.50−6.676
VE7.8247.2072.6702.9130.000−1.2370.809−1.482 to −0.9938111.5070239.50−7.335
DE27.39621.30011.6537.6180.000−1.3360.828−1.583 to −1.098606.5070734.50−6.816
P22.57116.5709.3306.1880.000−1.4240.843−1.672 to −1.1758358.0070486.00−7.045
Ncd6.7475.9601.8691.9330.000−1.5260.86−1.777 to −1.2747167.5069295.50−8.263
VCL5.2425.7300.6931.0330.000−1.6560.879−1.911 to −1.4015248.5067376.50−10.615
VC48.79135.98514.75910.5510.000−1.8120.900−2.071 to −1.5525672.5067800.50−9.506
D44.11031.49114.4898.2470.000−1.8490.905−2.11 to −1.5895603.0067731.00−9.573
VK9.7367.6882.2932.0860.000−1.8890.909−2.151 to −1.6275158.5067286.50−10.096
VA11.65910.1402.0882.2140.000−1.9190.913−2.181 to −1.6564616.5066744.50−10.605
VF4.1983.9640.2220.6290.000−2.1190.933−2.388 to −1.8494026.5066154.50−13.780
Nh41.69235.0431.4691.6520.000−2.5300.963−2.814 to −2.245620.5062748.50−14.398