Research Article

Event Driven Duty Cycling with Reinforcement Learning and Monte Carlo Technique for Wireless Network

Table 1

The notations used in the paper.

NotationDescription

qiThe capacity of the queue for nodes as node-i (i= 1, ..., N)
S, A, P, RComponents of MDP: state space, action space, transition probability, the reward function
αLearning rate
γDiscount factor
(i)Value of node-i
G = (V, E)WSN with the set of nodes, V, and edges, E
rTransmission range of a node
NB(i)Neighbor nodes of node-i
(i)Duration of slots when node-i works
wk(i)Slot when node-i is wake-up
p(i)The parent node of node-i
c(i)The child node of node-i
sch(i)Transmission schedule of node-i
F(i)Nodes of NB(i) forbidden to wake up
pc(i)Candidate parent nodes of node-i
τ = (ns, …, nd)The path from the source to the destination node