Research Article
A New Random Forest Algorithm Based on Learning Automata
Table 2
Details of textual data used for evaluation.
| | Domain | Name | # Feature | # Instance |
| | Text | Stanford—Sentiment 140 corpus [106] | Bag of word | 1600000 | | Large dataset of movie reviews [107] | Bag of word | 50000 | | Sentence polarity dataset v1.0 [108] | Bag of word | 10662 | | Internet movie database [105] | Bag of word | 1400 | | Yelp review [105] | Bag of word | 598000 | | Amazon review [105] | Bag of word | 1000000 | | Healthcare | Heart disease dataset [105] | 13 | 200 | | Breast cancer dataset [105] | 30 | 569 | | Arrhythmia dataset [105] | 279 | 454 | | Parkinson dataset [105] | 45 | 241 | | Caesarean section dataset [105] | 5 | 81 | | Gene expression dataset [105] | 255 | 801 | | Diabetes dataset [105] | 7 | 765 | | Statlog (heart) dataset [105] | 13 | 271 | | Physical | Ionosphere dataset [105] | 34 | 352 | | Sonar, mines vs. rocks dataset [105] | 60 | 208 | | Sound | Voice dataset [105] | 20 | 3168 | | Emotions from music dataset [105] | 28 | 592 |
|
|