Wireless Communications and Mobile Computing

Research Article

Dynamic Routing Strategy for Directed Transmissions of High-Valued Contents in NGSO Satellite-Based Internet

-learning in multicast routing.

1: Q(S,A) Connect Matrix
2: Destination Node
3: Source Node
4: fordo
5: Begin State
6: End State
7: for1,2,…end do
8: if is empty then
9: break
10: end if
11: for1,2,…end do
12: if Random then
13: A(1) Max(Q(S(), A()))
14: else
15: A(1) Random(Q(S(), A()))
16: end if
17: S S()
18: Q Q(S, A))
19: if S is empty then
20: Q Reward Constant
21: else
22: if S is equal to End State then
23: Q Reward Constant
24: else
25: Reward
26: Q Reward Max(Q(S, all A))
27: S(+1) S
28: end if
29: end if
30: Q(S, A) Q(S, A)
31: if converge then
32: Counter Counter
33: if Counter Counter_Threshold then
34: break
35: end if
36: else
37: Counter clear
38: end if
39: end for
40:
41:
42: end for
43:end for