GMOM: An Offloading Method of Dependent Tasks Based on Deep Reinforcement Learning

<table class="table-group" id="tab2"><tr><td><table class="table"><tr><td class="thead-hr" colspan="1"><hr/></td></tr><tr class="thead"><td class="align_left">Algorithm GMOM</td></tr><tr><td class="thead-hr" colspan="1"><hr/></td></tr><tr><td class="align_left">(1) Initialize the initial network parameter <svg height="9.49473pt" id="M147" style="vertical-align:-0.2063999pt" version="1.1" viewbox="-0.0498162 -9.28833 6.59789 9.49473" width="6.59789pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M475 507C475 612 440 712 326 712C139 712 23 420 23 215C23 96 58 -12 180 -12C369 -12 475 293 475 507ZM391 522C391 486 387 448 379 394H126C155 538 222 677 310 677C386 677 391 571 391 522ZM373 346C344 193 283 22 189 22C126 22 106 114 106 196C106 243 111 293 118 346H373Z"></path></g></svg> that the actor network and the critic network share randomly</td></tr><tr><td class="align_left">(2) Initialize the parameter <svg height="12.584pt" id="M148" style="vertical-align:-3.29567pt" version="1.1" viewbox="-0.0498162 -9.28833 20.2581 12.584" width="20.2581pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M475 507C475 612 440 712 326 712C139 712 23 420 23 215C23 96 58 -12 180 -12C369 -12 475 293 475 507ZM391 522C391 486 387 448 379 394H126C155 538 222 677 310 677C386 677 391 571 391 522ZM373 346C344 193 283 22 189 22C126 22 106 114 106 196C106 243 111 293 118 346H373Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,6.149,3.132)"><path d="M262 451C169 451 38 375 38 210C38 98 121 -12 261 -12C360 -12 481 64 481 227C481 351 386 451 262 451ZM249 414C337 414 385 320 385 204C385 70 333 25 272 25C189 25 134 117 134 241C134 355 189 414 249 414Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,10.808,3.132)"><path d="M244 0V30C178 35 170 41 170 107V710C136 697 72 680 18 675V647C83 642 89 639 89 572V107C89 43 80 35 16 30V0H244Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,14.697,3.132)"><path d="M526 56L493 58C457 61 448 68 448 119V710C411 697 344 682 288 675V646C362 641 367 639 367 573V439C344 448 315 451 300 451C164 451 40 342 40 202C40 61 147 -12 227 -12C239 -12 266 -7 305 16L367 53V-12C427 10 505 21 526 26V56ZM367 88C342 69 306 54 271 54C207 54 133 111 133 229C133 373 219 409 264 409C301 409 342 394 367 359V88Z"></path></g></svg> of the old actor network with <svg height="9.49473pt" id="M149" style="vertical-align:-0.2063999pt" version="1.1" viewbox="-0.0498162 -9.28833 6.59789 9.49473" width="6.59789pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M475 507C475 612 440 712 326 712C139 712 23 420 23 215C23 96 58 -12 180 -12C369 -12 475 293 475 507ZM391 522C391 486 387 448 379 394H126C155 538 222 677 310 677C386 677 391 571 391 522ZM373 346C344 193 283 22 189 22C126 22 106 114 106 196C106 243 111 293 118 346H373Z"></path></g></svg></td></tr><tr><td class="align_left">(3) For iteration = 1, 2, … do</td></tr><tr><td class="align_left">(4)  for <i>t</i> = 1, 2, …, <i>N</i> do</td></tr><tr><td class="align_left">(5)   for <i>i</i> = 1, 2, …, <i>D</i> do</td></tr><tr><td class="align_left">(6)    the whole episode is collected with the old actor network, and the obtained data is stored in the experience pool <i>D</i></td></tr><tr><td class="align_left">(7)    calculate the GAE function value for each time step according to formula (<a href="https://static-preview.hindawi.com/articles/misy/volume-2022/9587040/figures/#EEq14" target="_blank">14</a>), get <svg height="17.0133pt" id="M150" style="vertical-align:-3.5977pt" version="1.1" viewbox="-0.0498162 -13.4156 46.7549 17.0133" width="46.7549pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M686 28C612 35 607 44 591 112C563 234 541 360 519 489L489 666L457 658L147 121C100 40 89 36 24 28L17 0H240L250 28C168 34 159 41 190 101L262 237H482C495 180 503 137 510 91C517 47 514 35 441 28L433 0H677L686 28ZM475 280H285L429 541H431L475 280Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,9.135,-6.899)"><path d="M700 305H441V272C542 264 548 258 548 186V107C548 64 536 53 516 42C493 30 462 24 429 24C237 24 151 189 151 333C151 517 264 626 415 626C507 626 585 594 609 470L642 476C634 544 629 601 626 636C594 644 513 665 429 665C234 665 46 554 46 321C46 123 194 -15 415 -15C495 -15 578 7 642 22C635 50 635 81 635 116V201C635 259 639 264 700 272V305Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,15.732,-6.899)"><path d="M682 0V33C615 38 603 49 571 134C507 305 441 494 378 665L343 655L136 136C101 48 86 41 22 33V0H246V33C170 41 161 50 183 112C196 152 209 192 224 232H441C461 176 478 126 493 87C504 49 499 41 433 33V0H682ZM425 279H242C273 363 303 448 335 533H337L425 279Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,22.13,-6.899)"><path d="M522 166C507 123 488 90 472 71C449 44 419 38 347 38C296 38 261 38 242 51C223 63 217 85 217 132V315H311C398 315 406 310 419 239H453V433H419C406 365 401 358 311 358H217V580C217 609 219 612 251 612H331C400 612 427 604 440 584C453 561 463 541 472 498L506 502C501 558 497 627 497 650H43V618C122 610 130 609 130 520V128C130 46 121 40 30 32V0H521C529 30 552 128 557 162L522 166Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,27.408,-6.899)"><path d="M309 -142C211 -61 151 99 151 271S210 601 309 683L288 710C141 611 75 451 75 272V271C75 90 141 -72 288 -169L309 -142Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,30.665,-6.899)"><path d="M493 373C493 420 472 451 445 451C421 451 400 435 400 410C400 404 402 400 406 394S417 371 417 350C417 259 317 128 256 55H254C260 126 256 262 238 340C219 424 194 451 163 451C128 451 78 413 24 329L52 302C89 353 106 370 121 370C129 370 137 363 147 336C186 231 196 73 184 -16C152 -88 120 -192 112 -238L130 -257C157 -253 213 -227 231 -213C231 -187 234 -77 240 -23C251 -6 277 30 309 69C424 208 493 299 493 373Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,35.372,-6.899)"><path d="M98 134C72 134 46 117 46 90C46 73 55 65 60 64C95 55 124 32 124 -4C124 -42 95 -68 44 -89L57 -123C122 -104 194 -60 194 22C194 94 136 134 98 134Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,37.529,-6.899)"><path d="M547 100L523 123C489 76 461 62 452 62C442 62 433 71 427 109C405 243 387 408 378 495C360 666 322 710 263 710C230 710 182 688 161 667L170 639C184 646 203 651 218 651C247 651 272 634 288 562C297 521 300 485 304 436C230 267 111 104 24 11L33 -12L116 6C163 73 263 255 311 362L347 84C355 26 373 -12 406 -12C440 -12 489 12 547 100Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,42.724,-6.899)"><path d="M283 271C283 451 220 610 70 710L48 683C146 603 207 443 207 271S149 -59 48 -142L70 -169C220 -67 283 90 283 271Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,10.244,3.438)"><path d="M329 433H203L239 587L230 596L147 534L123 433H57L30 395L34 388H115L61 129C37 16 59 -12 85 -12C147 -12 222 58 260 98L241 125C212 95 160 62 144 62C132 62 127 71 138 126L192 386L305 394L329 433Z"></path></g></svg> and cache it</td></tr><tr><td class="align_left">(8)    calculate the value in each state according to formula (<a href="https://static-preview.hindawi.com/articles/misy/volume-2022/9587040/figures/#EEq17" target="_blank">17</a>) and get <svg height="16.3374pt" id="M151" style="vertical-align:-3.291101pt" version="1.1" viewbox="-0.0498162 -13.0463 37.5185 16.3374" width="37.5185pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M697 650H468L461 623L492 619C539 613 547 605 518 546C481 471 367 264 278 116H276C239 278 197 500 186 567C180 604 185 613 226 619L252 623L260 650H24L17 623C78 617 92 613 108 533L216 -11H247C365 200 515 462 560 529C616 612 624 615 689 623L697 650Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,9.224,-5.741)"><path d="M583 449L551 460C535 436 529 433 494 433C409 433 311 438 228 438C97 438 53 381 24 341L44 316C85 355 124 372 183 372C161 259 87 53 26 5L34 -12C53 -12 94 -4 118 12C166 92 211 256 236 371L394 369C375 301 352 212 334 121C329 96 326 75 326 58C326 5 349 -12 381 -12C431 -12 482 25 519 66L504 97C480 80 448 64 432 64C419 64 410 80 422 154C432 217 450 303 466 365C499 365 536 366 546 369C557 384 570 409 583 449Z"></path></g><g transform="matrix(.0065,0,0,-0.0065,14.694,-9.852)"><path d="M503 164C503 183 495 209 482 228C429 236 402 243 352 269C402 295 428 304 482 311C494 330 503 355 502 376C485 385 459 391 436 390C403 347 383 328 336 298C338 354 343 381 365 431C353 451 337 471 318 481C302 471 283 451 273 431C293 381 300 354 303 298C255 328 235 346 202 389C179 390 153 385 135 374C135 355 143 329 156 310C210 302 236 295 286 269C236 243 210 234 156 227C144 208 135 183 136 162C153 153 180 147 202 148C235 191 255 210 303 240C300 184 295 157 274 107C285 87 301 67 320 57C337 67 355 87 365 107C345 157 339 184 336 240C383 210 404 192 436 149C460 148 485 153 503 164Z"></path></g><g transform="matrix(.013,0,0,-0.013,19.866,0)"><path d="M300 -147C201 -63 143 98 143 270S200 602 300 686L282 710C136 610 70 450 70 271V270C70 89 136 -72 282 -170L300 -147Z"></path></g><g transform="matrix(.013,0,0,-0.013,24.364,0)"><path d="M352 391C352 416 319 448 267 448C236 448 173 423 147 400C107 364 96 332 96 304C96 248 143 210 193 181C241 153 258 124 258 100C258 72 232 38 184 38C151 38 107 66 81 108C77 114 64 116 55 111C34 99 23 84 23 65C23 29 81 -12 134 -12C220 -12 325 61 325 141C325 184 297 215 234 256C194 282 161 309 161 346C161 380 188 401 217 401C255 401 279 380 301 353C308 344 313 341 325 347C341 355 352 371 352 391Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,29.174,3.132)"><path d="M329 433H203L239 587L230 596L147 534L123 433H57L30 395L34 388H115L61 129C37 16 59 -12 85 -12C147 -12 222 58 260 98L241 125C212 95 160 62 144 62C132 62 127 71 138 126L192 386L305 394L329 433Z"></path></g><g transform="matrix(.013,0,0,-0.013,32.842,0)"><path d="M275 270C275 450 212 609 64 710L45 686C145 604 203 442 203 270S147 -63 45 -147L64 -170C213 -68 275 89 275 270Z"></path></g></svg></td></tr><tr><td class="align_left">(9)   end</td></tr><tr><td class="align_left">(10)   for <i>j</i> = 1, 2, …, <i>H</i> do</td></tr><tr><td class="align_left">(11)   sample batch size sample data to optimize the objective function, update the actor network</td></tr><tr><td class="align_left">(12)   end</td></tr><tr><td class="align_left">(13)   Synchronize the parameters of two actor networks, i.e., <svg height="12.584pt" id="M152" style="vertical-align:-3.29567pt" version="1.1" viewbox="-0.0498162 -9.28833 52.2753 12.584" width="52.2753pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M475 507C475 612 440 712 326 712C139 712 23 420 23 215C23 96 58 -12 180 -12C369 -12 475 293 475 507ZM391 522C391 486 387 448 379 394H126C155 538 222 677 310 677C386 677 391 571 391 522ZM373 346C344 193 283 22 189 22C126 22 106 114 106 196C106 243 111 293 118 346H373Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,6.149,3.132)"><path d="M262 451C169 451 38 375 38 210C38 98 121 -12 261 -12C360 -12 481 64 481 227C481 351 386 451 262 451ZM249 414C337 414 385 320 385 204C385 70 333 25 272 25C189 25 134 117 134 241C134 355 189 414 249 414Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,10.808,3.132)"><path d="M244 0V30C178 35 170 41 170 107V710C136 697 72 680 18 675V647C83 642 89 639 89 572V107C89 43 80 35 16 30V0H244Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,14.697,3.132)"><path d="M526 56L493 58C457 61 448 68 448 119V710C411 697 344 682 288 675V646C362 641 367 639 367 573V439C344 448 315 451 300 451C164 451 40 342 40 202C40 61 147 -12 227 -12C239 -12 266 -7 305 16L367 53V-12C427 10 505 21 526 26V56ZM367 88C342 69 306 54 271 54C207 54 133 111 133 229C133 373 219 409 264 409C301 409 342 394 367 359V88Z"></path></g><g transform="matrix(.013,0,0,-0.013,23.767,0)"><path d="M885 230V280H158L260 427L238 442C164 361 93 290 53 255C93 220 164 149 238 68L260 83L158 230H885Z"></path></g><g transform="matrix(.013,0,0,-0.013,33.677,0)"><path d="M567 230V280H69V230H567Z"></path></g><g transform="matrix(.013,0,0,-0.013,45.654,0)"><path d="M475 507C475 612 440 712 326 712C139 712 23 420 23 215C23 96 58 -12 180 -12C369 -12 475 293 475 507ZM391 522C391 486 387 448 379 394H126C155 538 222 677 310 677C386 677 391 571 391 522ZM373 346C344 193 283 22 189 22C126 22 106 114 106 196C106 243 111 293 118 346H373Z"></path></g></svg></td></tr><tr><td class="align_left">(14)  end</td></tr><tr><td class="align_left">(15) end</td></tr><tr class="table-tr"><td colspan="1"><hr class="tbody-hr"/></td></tr></table></td></tr></table>

<div>Training process of the GMOM model.</div>

Mobile Information Systems

tab2

Table 2

Table 2: GMOM: An Offloading Method of Dependent Tasks Based on Deep Reinforcement Learning