Research Article

An Empirical Study on Downstream Dependency Package Groups in Software Packaging Ecosystems

Table 10

Prediction results of CDDG size.

Prediction modelsCDDG size  100CDDG size  100
CargoCPANRubyGemsCargoCPANRubyGems

Linear regressionMAE35.1435.53101.7412.1614.4313.39
RMSE45.7761.84117.7514.7122.4424.10
MAPE13.08%18.78%72.77%32.76%37.66%25.99%

Random forestMAE25.1137.0761.234.531.4111.79
RMSE33.0565.4187.476.355.4115.80
MAPE9.11%17.18%36.86%10.65%2.4%25.18%

KNNMAE75.6944.8167.6113.9712.6513.08
RMSE95.6174.1193.8018.5515.7716.90
MAPE32.09%20.10%37.98%41.29%30.26%30.96%

AdaBoostMAE75.4840.1499.425.593.3214.56
RMSE116.6268.91150.937.025.8818.03
MAPE19.53%19.80%39.29%13.47%7.2%30.65%

GBRTMAE23.2736.2390.334.511.2012.30
RMSE28.4061.69157.876.363.5816.86
MAPE9.20%17.62%35.66%10.59%2.2%26.48%

Highlight the optimal results under each indicator of different models.