Journal of Sensors

Research Article

Robot Obstacle Avoidance Controller Based on Deep Reinforcement Learning

Symbols and rewards terms.


Symbols
	Running cycle
	Set of actions at th cycle
	Position set of agent at th cycle
	Agent’s central coordinates at th cycle
	Reward
	Position set of the obstacle
	Position set of the track
	Position set of the end
	Constant
	Empty set

Rewards terms
Distance reward
Obstacle collision penalty
Off the track penalty
Time penalty
Obstacle avoidance reward
Speed reward