Research Article
Deep Grid Scheduler for 5G NB-IoT Uplink Transmission
Algorithm 1
Procedure of double deep Q network.
| | Initialize the evaluation network and target network | | for episode in episodes do | | initialize and choose state from NB-IoT MAC | | while episode not end do | | sampling and get action | | observation from NB-IoT MAC and get reward and next state | | store experience | | | | | | update | | | | if batch size ≥ memory capacity then | | update | | end if | | end while | | end for |
|