Wireless Communications and Mobile Computing

Research Article

A Deep Learning-Based Power Control and Consensus Performance of Spectrum Sharing in the CR Network

Power control policy-deep learning training based.

Initialize D with capacity O
Initialize network with variable weights
Initialize and and then obtain (1)
For , do
Update via power control policy of PU (2) or (3)
With iterations , choose an arbitrary action ; otherwise, select
Obtain via arbitrary model (5) and detect reward
Store transition in
Sensing delay
Repeat sensing delay
If then
Sample random minibatch of iterations from ,
Here, the index is uniformly selected at independent
Minimize loss function of 12, in which goal is given by (15)
Adjust
End if
is the target state and then initialize and and then gain
end if
end for