Research Article
End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation
Table 4
The secondary training results for TC-network.
| | Method | Environment | TC-network (%) |
| | Pretraining | Parameter selection | 92.36 | | Maze-1 | 84.52 | | Maze-2 | 85.14 | | Maze-3 | 78.32 | | Targeted training | Maze-1 | 93.16 | | Maze-2 | 92.67 | | Maze-3 | 92.03 | | Generalization training | Maze-1/Maze-2 | 90.89 | | Maze-1/Maze-3 | 91.35 | | Maze-2/Maze-3 | 90.62 | | Maze-1/Maze-2/Maze-3 | 90.28 |
|
|