Custom Network Quantization Method for Lightweight CNN Acceleration on FPGAs

<table class="algorithm-group"><tr><td><table class="algorithm" id="alg1"><tr><td colspan="2"><b>Input:</b> Data, quantizers, pre-trained FP network with <svg height="8.8423pt" id="M60" style="vertical-align:-0.2064009pt" version="1.1" viewbox="-0.0498162 -8.6359 11.0475 8.8423" width="11.0475pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M822 650H589L583 622C660 617 677 607 674 561C672 534 664 481 647 390L600 137H596L273 650H126L120 622C176 620 194 615 207 594C221 571 225 557 214 504L161 257C141 166 129 112 121 85C108 42 83 30 29 28L23 0H260L266 28C193 33 173 42 176 89C178 122 186 172 202 255L256 527H259L583 -8H612L690 390C708 481 720 535 728 558C744 603 756 619 816 622L822 650Z"></path></g></svg> convolutional layers</td></tr><tr><td colspan="2"><b>Output:</b> The quantized network inference model</td></tr><tr><td colspan="2">1: Add quantizers before convolution operators;</td></tr><tr><td colspan="2">2: <b>for </b><svg height="10.2124pt" id="M61" style="vertical-align:-1.576501pt" version="1.1" viewbox="-0.0498162 -8.6359 116.968 10.2124" width="116.968pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M244 607C244 633 228 655 200 655C166 655 146 618 146 594C146 564 166 546 191 546C221 546 244 574 244 607ZM222 91L209 114C184 94 148 66 133 66C127 66 124 73 130 96L201 370C213 416 211 448 191 448C162 448 88 407 29 352L42 328C73 354 104 371 114 371C120 371 119 365 115 345L53 92C32 5 45 -12 68 -12C103 -12 186 50 222 91Z"></path></g><g transform="matrix(.013,0,0,-0.013,7.181,0)"><path d="M535 323V373H52V323H535ZM535 138V188H52V138H535Z"></path></g><g transform="matrix(.013,0,0,-0.013,18.444,0)"><path d="M384 0V27C293 34 287 42 287 114V635C232 613 172 594 109 583V559L157 557C201 555 205 550 205 499V114C205 42 199 34 109 27V0H384Z"></path></g><g transform="matrix(.013,0,0,-0.013,24.684,0)"><path d="M114 412C81 412 58 388 58 355C58 321 81 297 113 297S169 321 169 355C169 388 145 412 114 412ZM95 130C70 130 46 114 46 88C46 72 54 65 59 64C93 56 121 33 121 -3C121 -41 93 -68 45 -88L56 -118C117 -99 186 -56 186 22C186 91 131 130 95 130Z"></path></g><g transform="matrix(.013,0,0,-0.013,29.827,0)"><path d="M244 607C244 633 228 655 200 655C166 655 146 618 146 594C146 564 166 546 191 546C221 546 244 574 244 607ZM222 91L209 114C184 94 148 66 133 66C127 66 124 73 130 96L201 370C213 416 211 448 191 448C162 448 88 407 29 352L42 328C73 354 104 371 114 371C120 371 119 365 115 345L53 92C32 5 45 -12 68 -12C103 -12 186 50 222 91Z"></path></g><g transform="matrix(.013,0,0,-0.013,37.008,0)"><path d="M531 71V127L115 310L531 494V550L57 335V285L531 71ZM531 -40V10H57V-40H531Z"></path></g><g transform="matrix(.013,0,0,-0.013,48.271,0)"><path d="M822 650H589L583 622C660 617 677 607 674 561C672 534 664 481 647 390L600 137H596L273 650H126L120 622C176 620 194 615 207 594C221 571 225 557 214 504L161 257C141 166 129 112 121 85C108 42 83 30 29 28L23 0H260L266 28C193 33 173 42 176 89C178 122 186 172 202 255L256 527H259L583 -8H612L690 390C708 481 720 535 728 558C744 603 756 619 816 622L822 650Z"></path></g><g transform="matrix(.013,0,0,-0.013,59.181,0)"><path d="M114 412C81 412 58 388 58 355C58 321 81 297 113 297S169 321 169 355C169 388 145 412 114 412ZM95 130C70 130 46 114 46 88C46 72 54 65 59 64C93 56 121 33 121 -3C121 -41 93 -68 45 -88L56 -118C117 -99 186 -56 186 22C186 91 131 130 95 130Z"></path></g><g transform="matrix(.013,0,0,-0.013,64.324,0)"><path d="M244 607C244 633 228 655 200 655C166 655 146 618 146 594C146 564 166 546 191 546C221 546 244 574 244 607ZM222 91L209 114C184 94 148 66 133 66C127 66 124 73 130 96L201 370C213 416 211 448 191 448C162 448 88 407 29 352L42 328C73 354 104 371 114 371C120 371 119 365 115 345L53 92C32 5 45 -12 68 -12C103 -12 186 50 222 91Z"></path></g><g transform="matrix(.013,0,0,-0.013,71.505,0)"><path d="M885 230V280H158L260 427L238 442C164 361 93 290 53 255C93 220 164 149 238 68L260 83L158 230H885Z"></path></g><g transform="matrix(.013,0,0,-0.013,81.415,0)"><path d="M567 230V280H69V230H567Z"></path></g><g transform="matrix(.013,0,0,-0.013,93.392,0)"><path d="M244 607C244 633 228 655 200 655C166 655 146 618 146 594C146 564 166 546 191 546C221 546 244 574 244 607ZM222 91L209 114C184 94 148 66 133 66C127 66 124 73 130 96L201 370C213 416 211 448 191 448C162 448 88 407 29 352L42 328C73 354 104 371 114 371C120 371 119 365 115 345L53 92C32 5 45 -12 68 -12C103 -12 186 50 222 91Z"></path></g><g transform="matrix(.013,0,0,-0.013,99.847,0)"><path d="M535 230V280H323V490H265V280H52V230H265V-3H323V230H535Z"></path></g><g transform="matrix(.013,0,0,-0.013,110.384,0)"><path d="M384 0V27C293 34 287 42 287 114V635C232 613 172 594 109 583V559L157 557C201 555 205 550 205 499V114C205 42 199 34 109 27V0H384Z"></path></g></svg><b>do</b></td></tr><tr><td colspan="2">3:   Forward propagation by <svg height="11.5564pt" id="M62" style="vertical-align:-2.26807pt" version="1.1" viewbox="-0.0498162 -9.28833 21.5257 11.5564" width="21.5257pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M699 368C699 549 574 666 407 666C186 666 23 488 23 277C23 113 129 -3 288 -13L307 -26C431 -111 501 -139 533 -147C559 -154 613 -163 658 -164L666 -141C597 -111 507 -66 430 -11L416 -1C580 42 699 190 699 368ZM601 371C601 227 518 54 381 22L354 40L278 24C175 47 120 145 120 269C120 451 235 631 398 631C540 631 601 521 601 371Z"></path></g><g transform="matrix(.013,0,0,-0.013,9.386,0)"><path d="M300 -147C201 -63 143 98 143 270S200 602 300 686L282 710C136 610 70 450 70 271V270C70 89 136 -72 282 -170L300 -147Z"></path></g><g transform="matrix(.013,0,0,-0.013,13.884,0)"><path d="M170 255C170 288 146 313 114 313S58 288 58 255C58 221 82 198 114 198S170 221 170 255Z"></path></g><g transform="matrix(.013,0,0,-0.013,16.848,0)"><path d="M275 270C275 450 212 609 64 710L45 686C145 604 203 442 203 270S147 -63 45 -147L64 -170C213 -68 275 89 275 270Z"></path></g></svg> to weights of the network <svg height="12.1306pt" id="M63" style="vertical-align:-0.2063999pt" version="1.1" viewbox="-0.0498162 -11.9242 12.8091 12.1306" width="12.8091pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,1.292,-2.897)"><path d="M846 558C704 595 559 633 434 679H412C288 633 141 595 0 558L8 532L423 607L838 532L846 558Z"></path></g><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M962 650H739L732 622L760 619C812 613 819 604 798 552C760 457 671 267 606 131H604L511 638H480L237 131H233L183 554C177 607 183 614 226 619L248 622L257 650H24L17 622C88 615 92 611 103 524L173 -11H203L450 491H453L543 -11H575L839 529C882 609 886 613 953 622L962 650Z"></path></g></svg> and by <svg height="13.7042pt" id="M64" style="vertical-align:-2.2681pt" version="1.1" viewbox="-0.0498162 -11.4361 66.7771 13.7042" width="66.7771pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M699 368C699 549 574 666 407 666C186 666 23 488 23 277C23 113 129 -3 288 -13L307 -26C431 -111 501 -139 533 -147C559 -154 613 -163 658 -164L666 -141C597 -111 507 -66 430 -11L416 -1C580 42 699 190 699 368ZM601 371C601 227 518 54 381 22L354 40L278 24C175 47 120 145 120 269C120 451 235 631 398 631C540 631 601 521 601 371Z"></path></g><g transform="matrix(.013,0,0,-0.013,9.386,0)"><path d="M300 -147C201 -63 143 98 143 270S200 602 300 686L282 710C136 610 70 450 70 271V270C70 89 136 -72 282 -170L300 -147Z"></path></g><g transform="matrix(.013,0,0,-0.013,13.884,0)"><path d="M600 480C600 590 528 650 384 650H143L137 622C222 614 225 607 210 531L130 127C113 41 106 36 23 28L17 0H294L300 28C204 36 195 42 212 127L243 284L314 263C327 263 339 263 352 264C465 271 600 337 600 480ZM508 481C508 351 402 304 329 304C289 304 265 311 250 317L295 559C302 594 310 606 323 611C335 616 350 619 367 619C455 619 508 573 508 481Z"></path></g><g transform="matrix(.013,0,0,-0.013,20.344,0)"><path d="M686 28C612 35 607 44 591 112C563 234 541 360 519 489L489 666L457 658L147 121C100 40 89 36 24 28L17 0H240L250 28C168 34 159 41 190 101L262 237H482C495 180 503 137 510 91C517 47 514 35 441 28L433 0H677L686 28ZM475 280H285L429 541H431L475 280Z"></path></g><g transform="matrix(.013,0,0,-0.013,29.218,0)"><path d="M645 631C614 643 545 666 457 666C215 666 23 519 23 283C23 90 158 -16 337 -16C412 -16 489 2 522 10C543 39 590 127 606 167L580 181C519 89 459 18 348 18C201 18 122 136 122 287C122 464 244 632 435 632C544 632 602 595 608 472L639 475C643 526 645 581 645 631Z"></path></g><g transform="matrix(.013,0,0,-0.013,37.903,0)"><path d="M620 675H597C578 656 570 650 541 650H144C112 650 104 653 94 675H72C59 618 42 552 23 493L53 491C71 534 88 564 105 585C124 608 144 615 238 615H290L197 121C182 40 174 34 88 28L82 0H361L367 28C275 34 266 38 281 121L374 615H441C522 615 543 608 553 583C562 560 566 531 565 493L597 494C603 551 612 629 620 675Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,46.188,-5.741)"><path d="M310 541L304 571C290 586 211 619 185 610L80 76L131 52L310 541Z"></path></g><g transform="matrix(.013,0,0,-0.013,49.983,0)"><path d="M300 -147C201 -63 143 98 143 270S200 602 300 686L282 710C136 610 70 450 70 271V270C70 89 136 -72 282 -170L300 -147Z"></path></g><g transform="matrix(.013,0,0,-0.013,54.481,0)"><path d="M170 255C170 288 146 313 114 313S58 288 58 255C58 221 82 198 114 198S170 221 170 255Z"></path></g><g transform="matrix(.013,0,0,-0.013,57.445,0)"><path d="M275 270C275 450 212 609 64 710L45 686C145 604 203 442 203 270S147 -63 45 -147L64 -170C213 -68 275 89 275 270Z"></path></g><g transform="matrix(.013,0,0,-0.013,61.943,0)"><path d="M275 270C275 450 212 609 64 710L45 686C145 604 203 442 203 270S147 -63 45 -147L64 -170C213 -68 275 89 275 270Z"></path></g></svg> to activations of the network <span class="nowrap"><svg height="11.974pt" id="M65" style="vertical-align:-0.04979992pt" version="1.1" viewbox="-0.0498162 -11.9242 10.0819 11.974" width="10.0819pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,1.416,-2.897)"><path d="M658 557C549 593 437 629 340 673H318C222 629 109 593 0 557L9 532C116 551 225 577 329 602C435 576 543 551 649 532L658 557Z"></path></g><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M748 650H522L515 622L546 617C580 611 587 604 565 575C518 513 469 451 419 393C376 474 349 534 330 580C318 609 325 612 361 618L383 622L392 650H151L144 622C214 616 224 612 257 543L360 327C270 218 187 124 159 95C106 40 92 34 26 28L17 0H252L259 28L236 31C189 37 188 47 209 78C249 136 308 210 377 294L478 79C494 44 487 37 449 32L418 28L409 0H673L680 28C596 34 591 39 554 116L436 361C526 469 574 521 604 553C659 612 669 614 739 622L748 650Z"></path></g></svg>;</span></td></tr><tr><td colspan="2">4:   Backward propagation by STE to update network parameters;</td></tr><tr><td colspan="2">5: <b>end for</b></td></tr><tr><td colspan="2">6: Add quantizers before non-convolution operators;</td></tr><tr><td colspan="2">7: Re-train the network and subgraph fusion;</td></tr><tr><td colspan="2">8:<b>return</b> quantized network inference model;</td></tr></table></td></tr></table>

<div> Framework of custom network quantization.</div>

International Journal of Distributed Sensor Networks

alg1

Algorithm 1

Algorithm 1: Custom Network Quantization Method for Lightweight CNN Acceleration on FPGAs