Applied Computational Intelligence and Soft Computing

Research Article

Performance Augmentation of Base Classifiers Using Adaptive Boosting Framework for Medical Datasets

Table 2

Results for precision, F-measure, and recall error of all the classifiers in AdaBoost framework for various data sets.


Classifiers	Data sets	Leukaemia	Lymphoma-I	Lymphoma-II	GCM	Data set C
	Instances	38	45	96	144	60
	Attributes	7130	4027	4027	16064	7130

Naïve Bayes	Recall	0.947	0.911	0.75	0.167	0.6
	F-measure	0.946	0.911	0.692	0.048	0.56
	Precision	0.951	0.914	0.683	0.028	0.552

Voted perceptron	Recall	0.789	0.978	xx	0.167	0.6
	F-measure	0.799	0.978	xx	0.048	0.58
	Precision	0.847	0.979	xx	0.028	0.583

Stacking	Recall	0.711	0.511	0.479	0.167	0.65
	F-measure	0.59	0.511	0.31	0.048	0.512
	Precision	0.505	0.511	0.23	0.028	0.423

Adaboost	Recall	0.895	0.867	0.51	0.167	0.583
	F-measure	0.895	0.867	0.445	0.094	0.52
	Precision	0.895	0.87	0.403	0.066	0.5

Bagging	Recall	0.921	0.933	0.865	xx	0.633
	F-measure	0.92	0.933	0.84	xx	0.629
	Precision	0.92	0.934	0.836	xx	0.626

J48	Recall	0.842	0.822	0.835	xx	0.567
	F-measure	0.842	0.822	0.835	xx	0.546
	Precision	0.842	0.825	0.864	xx	0.536

Random tree	Recall	0.789	0.644	0.667	0.382	0.683
	F-measure	0.789	0.643	0.649	0.373	0.685
	Precision	0.789	0.645	0.648	0.37	0.687

Random forest	Recall	0.791	0.778	0.781	0.521	0.65
	F-measure	0.761	0.776	0.743	0.509	0.643
	Precision	0.791	0.788	0.75	0.53	0.639

Bayes network	Recall	0.933	0.978	xx	0.167	0.633
	F-measure	0.933	0.978	xx	0.048	0.629
	Precision	0.934	0.979	xx	0.028	0.626

Decision stump	Recall	0.895	0.867	0.51	0.167	0.633
	F-measure	0.895	0.867	0.445	0.094	0.629
	Precision	0.895	0.87	0.403	0.066	0.626

Zero-R	Recall	0.711	0.444	0.479	0.167	0.65
	F-measure	0.59	0.429	0.31	0.048	0.512
	Precision	0.505	0.434	0.23	0.028	0.423

Input mapped classifier	Recall	0.711	0.444	0.479	0.167	0.567
	F-measure	0.59	0.429	0.31	0.048	0.571
	Precision	0.505	0.434	0.23	0.028	0.576

The crossed (xx) cells show that the results could not be generated for the specific classifier because of the limitation of the framework or the data set. Hence, the evaluation of these classifiers’ results has been carried out manually to check if any better results could be gathered for comparison. The results in bold indicate the best results for different datasets.