Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning

<table class="table-group" id="tab12"><tr><td><table class="table"><tr><td class="thead-hr" colspan="5"><hr/></td></tr><tr><td class="align_left">Method</td><td class="align_center">lifetime</td><td class="align_center">Lead time</td><td class="align_center"><span style="width: 26.4842ptpx;"><svg height="8.57479pt" id="M408" style="vertical-align:-0.04981041pt" version="1.1" viewbox="-0.0498162 -8.52498 26.4842 8.57479" width="26.4842pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M861 0V28C774 35 771 41 768 147L759 509C756 612 762 614 851 622V650H681L449 149L221 650H57V622C148 613 153 609 144 479L130 271C123 166 117 123 111 88C104 46 85 34 26 28V0H259V28C192 35 169 42 167 90C166 130 166 173 170 256L185 541H187L411 7H431L675 555H679L683 147C683 41 680 35 598 28V0H861Z"></path></g><g transform="matrix(.013,0,0,-0.013,11.583,0)"><path d="M517 162C503 123 484 88 467 68C445 42 417 34 341 34C291 34 256 34 237 47C219 59 213 81 213 128V317H308C395 317 402 311 415 240H444V431H415C403 364 398 356 307 356H213V584C213 613 215 616 246 616H322C394 616 421 609 435 587C448 566 458 544 467 502L496 506C493 557 488 625 488 650H42V622C120 616 128 612 128 523V125C128 43 120 35 29 28V0H511C520 31 540 125 546 158L517 162Z"></path></g><g transform="matrix(.013,0,0,-0.013,18.967,0)"><path d="M46 650V622C120 617 128 613 128 525V125C128 42 120 34 40 28V0H311V28C221 34 212 39 212 124V281L286 262C297 261 316 261 331 263C429 275 526 338 526 468C526 533 501 579 462 609C422 638 364 650 293 650H46ZM212 559C212 588 215 600 223 606C230 613 251 618 279 618C361 618 430 572 430 464C430 337 350 302 285 302C252 302 225 309 212 314V559Z"></path></g></svg></span></td><td class="align_center"><span style="width: 30.2292ptpx;"><svg height="8.98583pt" id="M409" style="vertical-align:-0.2324905pt" version="1.1" viewbox="-0.0498162 -8.75334 30.2292 8.98583" width="30.2292pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M861 0V28C774 35 771 41 768 147L759 509C756 612 762 614 851 622V650H681L449 149L221 650H57V622C148 613 153 609 144 479L130 271C123 166 117 123 111 88C104 46 85 34 26 28V0H259V28C192 35 169 42 167 90C166 130 166 173 170 256L185 541H187L411 7H431L675 555H679L683 147C683 41 680 35 598 28V0H861Z"></path></g><g transform="matrix(.013,0,0,-0.013,11.583,0)"><path d="M43 650V622C120 616 128 612 128 526V124C128 39 120 33 34 27V0H270C392 0 492 25 567 83C643 141 690 230 690 350C690 444 655 517 605 565C543 625 450 650 323 650H43ZM213 547C213 587 217 598 226 604C236 612 262 617 304 617C371 617 429 604 474 576C554 529 592 439 592 336C592 176 505 36 319 36C246 36 213 55 213 131V547Z"></path></g><g transform="matrix(.013,0,0,-0.013,21.373,0)"><path d="M614 175C564 76 510 21 408 21C256 21 146 149 146 336C146 488 235 629 402 629C510 629 570 586 597 480L626 488C620 541 614 582 606 638C578 643 510 665 429 665C206 665 44 527 44 316C44 157 153 -15 402 -15C474 -15 558 5 586 11C604 45 629 119 643 165L614 175Z"></path></g></svg></span></td></tr><tr><td class="align_left" colspan="5"><hr/></td></tr><tr><td class="align_left" rowspan="2">PAQ-DQN</td><td class="align_center">2</td><td class="align_center">0</td><td class="align_center">5394.735</td><td class="align_center">21.519</td></tr><tr><td class="align_center"> </td><td class="align_center">1</td><td class="align_center">2761.297</td><td class="align_center">371.397</td></tr><tr><td class="align_left" rowspan="2">PAQ-A2C</td><td class="align_center">2</td><td class="align_center">0</td><td class="align_center">5367.027</td><td class="align_center">16.139</td></tr><tr><td class="align_center"> </td><td class="align_center">1</td><td class="align_center">2739.912</td><td class="align_center">330.639</td></tr><tr><td class="align_left" rowspan="2"><i>Q</i>-learning</td><td class="align_center">2</td><td class="align_center">0</td><td class="align_center">4437.220</td><td class="align_center">87.911</td></tr><tr><td class="align_center"> </td><td class="align_center">1</td><td class="align_center"><span style="width: 17.6545ptpx;"><svg height="8.98582pt" id="M410" style="vertical-align:-0.6370001pt" version="1.1" viewbox="-0.0498162 -8.34882 17.6545 8.98582" width="17.6545pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M512 -3V55L134 254V256L512 456V514L75 281V230L512 -3Z"></path></g><g transform="matrix(.013,0,0,-0.013,11.263,0)"><path d="M241 635C89 635 35 457 35 312C35 153 89 -12 240 -12C390 -12 443 166 443 312C443 466 390 635 241 635ZM238 602C329 602 354 454 354 312C354 172 330 22 240 22C152 22 124 173 124 313S148 602 238 602Z"></path></g></svg></span></td><td class="align_center">0</td></tr><tr class="table-tr"><td colspan="5"><hr class="tbody-hr"/></td></tr></table></td></tr></table>

Complexity

tab12

Table 12

Table 12: Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning