Research Article
A Deep Learning-Based Power Control and Consensus Performance of Spectrum Sharing in the CR Network
Algorithm 1
Power control policy-deep learning training based.
| Initialize D with capacity O | | Initialize network with variable weights | | Initialize and and then obtain (1) | | For , do | | Update via power control policy of PU (2) or (3) | | With iterations , choose an arbitrary action ; otherwise, select | | Obtain via arbitrary model (5) and detect reward | | Store transition in | | Sensing delay | | Repeat sensing delay | | If then | | Sample random minibatch of iterations from , | | Here, the index is uniformly selected at independent | | Minimize loss function of 12, in which goal is given by (15) | | Adjust | | End if | | is the target state and then initialize and and then gain | | end if | | end for |
|