Research Article

PPDRL: A Pretraining-and-Policy-Based Deep Reinforcement Learning Approach for QoS-Aware Service Composition

Table 3

The QoS values (mean and variance) on six different test cases.

Problem #PPDRLMCOP_MGAPTRQLRDQN
MeanVarMeanVarMeanVarMeanVarMeanVarMeanVar

Nodes 100.320.000.242.22e − 30.320.00000.296.83e − 50.291.19e − 40.252.45e − 4
Nodes 300.310.000.134.23e − 30.284.38e − 30.131.55e − 50.125.54e − 50.051.44e − 4
Nodes 500.200.00000.081.37e − 30.201.07e − 70.137.12e − 60.134.83e − 60.063.32e − 4
Nodes 700.156.64e-120.062.89e − 40.141.02e − 60.071.49e − 50.072.22e − 50.021.72e − 4
Nodes 900.140.00000.054.30e − 50.134.56e − 60.046.50e − 50.037.36e − 5-0.023.48e − 5
Nodes 1000.207.54e-70.094.30e − 40.182.31e − 60.092.57e − 50.082.31e − 40.055.18e − 4