Research Article

Relevant-Based Feature Ranking (RBFR) Method for Text Classification Based on Machine Learning Algorithm

Table 1

Comparison of recent works related to imbalanced classification.

ReferenceTechniqueMethodologyComments

[23]Extreme gradient boostingTime-, frequency-, and spatial-based features were extracted by the proposed algorithm. Random forest is used for classification.Correlation in time-based features can be improved. Embedded FS can be incorporated.
[24]Orthogonal least squaresThe authors have improved the speed of fetching the best features using orthogonal least squares. They have compared mutual information and other embedded methods.Multiple correlation coefficient and the canonical correlation coefficient can be improved when feature generation and instance generation methods are used.
[25]Centroid mutation-based searchA set of features which can represent a strong convergence to a set of classes is identified. This increases the position of classification margin and reduces the error.The noisy features can be identified and removed before finding the strong convergence.
[26]Balanced pointwise mutual informationA deep learning model is employed in Twitter text classification. Special characters like emoji are used as features to classify tweets.Spam detection can be implemented to increase the accuracy.
[27]Term weightingMost of the feature selection methods just use frequency. The authors used category information as additional metric to select features for classification.Semantics information can degrade the performance of the classification.