Research Article
Optimal Channel Selection Based on Online Decision and Offline Learning in Multichannel Wireless Sensor Networks
Algorithm 2
Channel selection based on
-learning.
| // Initialization | | (1) Each sensor initializes its action space | | (2) Each sensor initializes its -value | | //Learning | | (3) Sensor takes random action | | (4) Sensor observes reward and state | | (5)while () do | | (6) for to | | (7) for to | | (8)if is good | | (9) | | (10) else | | (11) | | (12)end if | | (13) Sensor updates its -value according to (17) | | (13) | | (14) | | (15) end for | | (16) end for | | (17)end while | | (18) Choose from using policy derived from |
|