Research Article

Optimal Path Planning for Wireless Power Transfer Robot Using Area Division Deep Reinforcement Learning

Table 1

Symbols and explanations.

SymbolExplanation

The number of energy harvesters
Rectifier efficiency
Gain of transmitter’s antenna
Gain of receiver’s antenna
Wavelength of transmitted signal
Polarization loss
Friis’s free space adjustable parameter
Distance between transmitter and harvester
Transmit power
Received power
Angle between transmitter and the vertical reference line
Maximum effective transmit area
Effective received area
Time instant
Position and units to left and upmost edges
Position of th energy harvester
Effective charging area for th IoT devices
Present system state
Next system state
Action taken at
Total time consumption
Transition probability from state to state taking action
Reward function at state taking action
Indicator whether harvesters have been charged
Unit price for reward function
Optimal strategy
Cost function at state taking action
Reward decay
Learning rate for Q-learning
Selected location for th area
th area