Research Article
Edge Caching for D2D Enabled Hierarchical Wireless Networks with Deep Reinforcement Learning
Algorithm 1
Q-Learning-based content caching algorithm.
| Initialization: Q-Table | | Iteration: | | 1: for each episode | | 2: Initialize | | 3: for each step of episode | | 4: Generate at random | | 5: if | | 6: randomly select an action | | 7: else | | 8: choose using policy derived from | | 9: Take action | | 10: Obtain and | | 11: Update Q-Table: | | 12: | | 13: end for | | 14: end for |
|