Review Article

Predicting Student Academic Performance at Higher Education Using Data Mining: A Systematic Review

Table 2

Summary of the related studies on predicting student’s level in the course.

RefTechniquesResultsStudy sample sizeFindings

[29]RF & ANNAccuracy (74%)1854The results showed that the RF and ANN outperformed SVM, LR, NB, and KNN.
[31]RFAccuracy (79%–91%)32,593The feature engineering technique improved the prediction model where it achieved more than 80% accuracy.
[32]Ensemble (bagging)Accuracy (66.7%) for dataset_1Dataset_1 : 52The highest results were achieved when considering 50% of the coursework.
(93.1%) for dataset_2Dataset_2 : 486
[33]LRAccuracy (78.75%)499Several factors including data cleaning, the type of features, and the dataset size influenced the final model’s accuracy.
[34]RFAccuracy (59.64%)289The formative assessment tasks grades were able to predict at-risk students.
[35]Hybrid (DT and K-means)Accuracy (75.47%)Student’s behavior and students’ academic success are very strongly related.
[36]RFAccuracy (96.4%)145Regression methods outperformed the classification in predicting course GPA.
[37]BP-NN101NCEE score and learning attitude have a major influence on the final exam results.
[38]ANN (MLP)Accuracy (76.07%)163
[30]Ensemble (bagging)Accuracy (99.40%)480Ensemble models outperformed other classification algorithms.
[39]ANNAccuracy (78.1%)480Student’s attendance and parents’ participation have a great impact on student’s performance level.
[40]SVMAccuracy (76.3%)38Preadmission information influences the student grades
[41]Ensemble method with DTAccuracy (95.78%)1044A balanced dataset and the selected features increase the model’s accuracy.
[42]RFF1-score (74%)240The accuracy of the prediction model has improved when considering the 2nd test.
[43]RFMAE (1.198 to 1.91)592RF provides the best result followed by SMOreg and bagging.
[44]SVMF1-score (92%) for distance datasetOnline dataset: 262The accuracy of the prediction model has improved when the student performed 50% of distance education course and 25% of on-campus education course.
F1-score (83%) for on-campus datasetOn-campus dataset: 161
[45]RBMRMSE (0.3)225CGPA, HSSC, and an entry test can predict student performance.
[46]NB700Demographic information and school grades allow predicting a college student’s level.