Research Article
PPDRL: A Pretraining-and-Policy-Based Deep Reinforcement Learning Approach for QoS-Aware Service Composition
| | Require:: the given composite service, : the service class for | | , : training steps, : batch size. | | Ensure: the optimal QoS value for | | Initialize neural network params ; | | Generate initial samples; | | while convergence condition is not satisfied do | | Update samples with better results if not the first cycle; | | Pretrain the neural network based on MLE; | | for to do | | Given and , get the candidate services score distribution ; | | ; | | | | ; | | end for | | end while |
|