Research Article
Detecting Web Spam Based on Novel Features from Web Page Source Code
Table 4
Results of using different 3 feature sets on random forest model.
| Feature set | Accuracy | Precision | Recall | F1 score | AUC |
| Selected existing features | 0.911 | 0.909 | 0.911 | 0.909 | 0.937 | Novel features | 0.911 | 0.910 | 0.911 | 0.907 | 0.917 | Selected existing + novel features | 0.930 | 0.929 | 0.930 | 0.829 | 0.957 |
|
|