Representation Enhancement-Based Proximal Policy Optimization for UAV Path Planning and Obstacle Avoidance

<table class="table-group" id="tab2"><tr><td><table class="table"><tr><td class="thead-hr" colspan="2"><hr/></td></tr><tr class="thead"><td class="align_left">Parameter</td><td class="align_center">Value</td></tr><tr><td class="thead-hr" colspan="2"><hr/></td></tr><tr><td class="align_left"><span style="width: 7.30254ptpx;"><svg height="9.49473pt" id="M166" style="vertical-align:-0.2063999pt" version="1.1" viewbox="-0.0498162 -9.28833 7.30254 9.49473" width="7.30254pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M529 97L508 118C475 75 449 58 438 58C428 58 421 66 415 104C393 234 374 403 364 496C345 670 307 712 254 712C220 712 174 691 153 669L161 645C176 653 194 658 206 658C237 658 261 640 278 562C287 522 290 483 293 434C223 269 110 105 23 9L32 -12C59 -6 85 0 108 7C152 64 251 252 300 366C307 297 315 221 337 82C346 24 363 -12 393 -12C425 -12 475 13 529 97Z"></path></g></svg></span></td><td class="align_center">0.97</td></tr><tr><td class="align_left"><span style="width: 5.63229ptpx;"><svg height="6.1673pt" id="M167" style="vertical-align:-0.2063904pt" version="1.1" viewbox="-0.0498162 -5.96091 5.63229 6.1673" width="5.63229pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M401 397C401 420 368 448 302 448C245 448 169 416 122 377C62 327 23 254 23 169C23 45 83 -12 181 -12C252 -12 323 29 374 85L358 107C305 62 257 43 210 43C147 43 110 98 110 189V214L313 208L321 256L115 250C132 342 190 405 253 405C291 405 323 389 346 360C356 348 364 348 377 357C392 367 401 384 401 397Z"></path></g></svg></span></td><td class="align_center">0.2</td></tr><tr><td class="align_left"><span class="nowrap"><svg height="8.8423pt" id="M168" style="vertical-align:-0.2064009pt" version="1.1" viewbox="-0.0498162 -8.6359 11.0475 8.8423" width="11.0475pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M822 650H589L583 622C660 617 677 607 674 561C672 534 664 481 647 390L600 137H596L273 650H126L120 622C176 620 194 615 207 594C221 571 225 557 214 504L161 257C141 166 129 112 121 85C108 42 83 30 29 28L23 0H260L266 28C193 33 173 42 176 89C178 122 186 172 202 255L256 527H259L583 -8H612L690 390C708 481 720 535 728 558C744 603 756 619 816 622L822 650Z"></path></g></svg>-</span>step</td><td class="align_center">3</td></tr><tr><td class="align_left">Actor iteration</td><td class="align_center">10</td></tr><tr><td class="align_left">Critic iteration</td><td class="align_center">10</td></tr><tr><td class="align_left">Actor learning rate</td><td class="align_center">1.0<span class="nowrap"><svg height="6.1673pt" id="M169" style="vertical-align:-0.2063904pt" version="1.1" viewbox="-0.0498162 -5.96091 5.50181 6.1673" width="5.50181pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M391 364C391 409 353 448 295 448C249 448 198 426 152 393C65 331 23 225 23 139C23 14 96 -12 146 -12C198 -12 280 9 367 101L351 124C300 78 242 48 194 48C129 48 109 107 109 162V191C208 213 391 266 391 364ZM313 350C313 305 268 261 113 223C132 334 187 381 217 398C227 404 244 405 261 405C290 405 313 385 313 350Z"></path></g></svg>-</span>4</td></tr><tr><td class="align_left">Critic learning rate</td><td class="align_center">1.0<span class="nowrap"><svg height="6.1673pt" id="M170" style="vertical-align:-0.2063904pt" version="1.1" viewbox="-0.0498162 -5.96091 5.50181 6.1673" width="5.50181pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M391 364C391 409 353 448 295 448C249 448 198 426 152 393C65 331 23 225 23 139C23 14 96 -12 146 -12C198 -12 280 9 367 101L351 124C300 78 242 48 194 48C129 48 109 107 109 162V191C208 213 391 266 391 364ZM313 350C313 305 268 261 113 223C132 334 187 381 217 398C227 404 244 405 261 405C290 405 313 385 313 350Z"></path></g></svg>-</span>4</td></tr><tr><td class="align_left">Optimizer</td><td class="align_center">Adam</td></tr><tr><td class="align_left">Reward discount factor</td><td class="align_center">0.9</td></tr><tr class="table-tr"><td colspan="2"><hr class="tbody-hr"/></td></tr></table></td></tr></table>

<div>Optimization parameters of PPO.</div>

International Journal of Aerospace Engineering

tab2

Table 2

Table 2: Representation Enhancement-Based Proximal Policy Optimization for UAV Path Planning and Obstacle Avoidance