Research Article

An Empirical Study on Downstream Dependency Package Groups in Software Packaging Ecosystems

Table 11

Prediction results of number of releases of CDDGs.

Prediction modelsCDDG size  100CDDG size  100
CargoCPANRubyGemsCargoCPANRubyGems

Linear regressionMAE3.571.526.563.726.8711.02
RMSE5.082.467.895.068.6317.06
MAPE18.07%11.26%23.82%26.95%36.85%36.50%

Random forestMAE2.511.435.743.166.579.70
RMSE2.902.277.364.458.4514.07
MAPE16.10%9.59%21.75%20.30%30.46%34.31%

KNNMAE2.541.417.243.748.029.72
RMSE2.822.138.555.109.8413.57
MAPE15.70%9.12%26.20%24.15%36.00%33.03%

AdaBoostMAE3.231.797.183.807.9212.94
RMSE3.922.589.304.979.1215.49
MAPE21.12%12.14%24.56%22.14%37.72%37.71%

GBRTMAE2.701.526.183.596.429.62
RMSE3.372.437.705.009.0314.21
MAPE18.70%10.43%22.92%22.58%28.49%36.76%

Highlight the optimal results under each indicator of different models.