Research Article
Deep Reinforcement Learning for Scheduling and Offloading in UAV-Assisted Mobile Edge Networks
Algorithm 1
traverse relaxation vectors
and
to find optimal
and
.Input: Parameters , and | Output: , | 1: Elements in the vectors and are rearranged in ascending order to get and | 2: Initialize as a very large number and and as empty vectors; | 3: fordo | 4: fordo | 5: fordo | 6: Compute Q() from (8); | 7: ifQ() < then | 8: ; | 9: [k] = ; | 10: ; | 11: end if | 12: end for | 13: end for | 14: end for | 15: return,. |
|