Research Article

Robot Obstacle Avoidance Controller Based on Deep Reinforcement Learning

Table 1

Symbols and rewards terms.

Symbols
Running cycle
Set of actions at th cycle
Position set of agent at th cycle
Agent’s central coordinates at th cycle
Reward
Position set of the obstacle
Position set of the track
Position set of the end
Constant
Empty set

Rewards terms
Distance reward
Obstacle collision penalty
Off the track penalty
Time penalty
Obstacle avoidance reward
Speed reward