Research Article

Breast Cancer Identification from Patients’ Tweet Streaming Using Machine Learning Solution on Spark

Table 4

Results of models were applied on the selected features by RFECV.

ModelAccuracy of cross-validation (%)Accuracy of unseen data (%)Best value of parameters

LR99.598.8regPram: 0.1
maxIter: 20
DT98.691.2imprity: gini
maxDepth: 5
maxBins: 32
SVM98.998.5regParam: 0.02
maxIter: 50
Kernal type: Liner
RF10099.1maxDepth: 7
maxBins: 32
numTrees: 20