Research Article
Alignment Method of Combined Perception for Peg-in-Hole Assembly with Deep Reinforcement Learning
| 1: Initialize replay buff | | 2: Initialize evaluation network parameters | | 3: Initialize target network parameters | | 4: for episode=1, do | | 5: for, do | | 6: Obtain image from environment | | 7: With probability select a random adjustment action | | 8: otherwise select adjustment action | | 9: Execute adjustment action in CoppeliaSim | | 10: Obtain image and reward from environment | | 11: Store transition in | | 12: Sample random minibatch of transitions from | | 13: Set | | 14: Perform a gradient descent step on | | 15: end for | | 16: end for |
|