Research Article
Dynamic Routing Strategy for Directed Transmissions of High-Valued Contents in NGSO Satellite-Based Internet
Algorithm 2
-learning in multicast routing.
1: Q(S,A) Connect Matrix | 2: Destination Node | 3: Source Node | 4: fordo | 5: Begin State | 6: End State | 7: for1,2,…end do | 8: if is empty then | 9: break | 10: end if | 11: for1,2,…end do | 12: if Random then | 13: A(1) Max(Q(S(), A())) | 14: else | 15: A(1) Random(Q(S(), A())) | 16: end if | 17: S S() | 18: Q Q(S, A)) | 19: if S is empty then | 20: Q Reward Constant | 21: else | 22: if S is equal to End State then | 23: Q Reward Constant | 24: else | 25: Reward | 26: Q Reward Max(Q(S, all A)) | 27: S(+1) S | 28: end if | 29: end if | 30: Q(S, A) Q(S, A) | 31: if converge then | 32: Counter Counter | 33: if Counter Counter_Threshold then | 34: break | 35: end if | 36: else | 37: Counter clear | 38: end if | 39: end for | 40: | 41: | 42: end for | 43:end for |
|