Research Article
Breast Cancer Identification from Patients’ Tweet Streaming Using Machine Learning Solution on Spark
Table 5
Cross-validation result of ML models that are applied to features selected by univariate.
| Models | Accuracy of cross-validation (%) | Accuracy of unseen data (%) | Best value of parameters |
| LR | 98.6 | 98.4 | regPram: 0.1 maxIter: 30 | DT | 97.80 | 90.35 | impuity: gini maxDepth: 5 maxBins: 32 | SVM | 98.2 | 98.07 | regParam: 0.02 maxIter: 50 Kernal type: Liner | RF | 99.1 | 93.85 | maxDepth: 6 maxBins: 32 numTrees: 20 |
|
|