Research Article

A Deep Learning-Based Power Control and Consensus Performance of Spectrum Sharing in the CR Network

Algorithm 1

Power control policy-deep learning training based.
  Initialize D with capacity O
  Initialize network with variable weights
  Initialize and and then obtain (1)
  For , do
  Update via power control policy of PU (2) or (3)
 With iterations , choose an arbitrary action ; otherwise, select
  Obtain via arbitrary model (5) and detect reward
  Store transition in
  Sensing delay
  Repeat sensing delay
  If then
  Sample random minibatch of iterations from ,
  Here, the index is uniformly selected at independent
  Minimize loss function of 12, in which goal is given by (15)
  Adjust
  End if
   is the target state and then initialize and and then gain
 end if
  end for