Research Article

Breast Cancer Identification from Patients’ Tweet Streaming Using Machine Learning Solution on Spark

Table 1

The features’ description of the database.

#FeatureAbbreviationDescription

1radius_meanra_meanMean of distances from the center to points on the perimeter
2texture_meante_meanStandard deviation of grayscale values
3smoothness_meansm_meanMean of local variation in radius lengths
4compactness_meancom_meanMean of local variation in radius lengths
5concavity_meancon_meanMean of severity of concave portions of the contour
6symmetry_meansy_mean
7fractal_dimension_meanfr_di_meanMean for “coastline approximation”-1
8radius_sera_seStandard error for the mean of distances from the center to points on the perimeter
9texture_sete_seStandard error for the standard deviation of grayscale values
10smoothness_sesm_seStandard error for local variation in radius lengths
11compactness_secom_seStandard error for perimeter^2/area-1.0
12concavity_secon_seStandard error for severity of concave portions of the contour
13concave_points_secon_po_seStandard error for the number of concave portions of the contour
14symmetry_sesy_se
15fractal_dimension_sefr_di_seStandard error for “coastline approximation”-1
16smoothness_worstsm_worst“Worst” or largest mean value for local variation in radius lengths
17compactness_worstcom_worst“Worst” or largest mean value for perimeter^2/area-1.0
18concavity_worstcon_worst“Worst” or largest mean value for severity of concave portions of the contour
19symmetry_worstsym_worst
20fractal_dimension_worstfra_dim_worst“Worst” or largest mean value for “coastline approximation”-1