Investigating the Effects of Hyperparameters in Quantum-Enhanced Deep Reinforcement Learning

<table class="table-group" id="tab3"><tr><td><table class="table"><tr><td class="thead-hr" colspan="5"><hr/></td></tr><tr class="thead"><td class="align_left"> </td><td class="align_left"><span style="width: 15.184ptpx;"><svg height="11.439pt" id="M65" style="vertical-align:-2.15067pt" version="1.1" viewbox="-0.0498162 -9.28833 15.184 11.439" width="15.184pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M162 -163V703H101V-163H162Z"></path></g><g transform="matrix(.013,0,0,-0.013,3.419,0)"><path d="M384 0V27C293 34 287 42 287 114V635C232 613 172 594 109 583V559L157 557C201 555 205 550 205 499V114C205 42 199 34 109 27V0H384Z"></path></g><g transform="matrix(.013,0,0,-0.013,9.659,0)"><path d="M307 267V273L115 703H53L246 270L53 -163H115L307 267Z"></path></g></svg></span></td><td class="align_left"><span style="width: 15.184ptpx;"><svg height="11.439pt" id="M66" style="vertical-align:-2.15067pt" version="1.1" viewbox="-0.0498162 -9.28833 15.184 11.439" width="15.184pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M162 -163V703H101V-163H162Z"></path></g><g transform="matrix(.013,0,0,-0.013,3.419,0)"><path d="M241 635C89 635 35 457 35 312C35 153 89 -12 240 -12C390 -12 443 166 443 312C443 466 390 635 241 635ZM238 602C329 602 354 454 354 312C354 172 330 22 240 22C152 22 124 173 124 313S148 602 238 602Z"></path></g><g transform="matrix(.013,0,0,-0.013,9.659,0)"><path d="M307 267V273L115 703H53L246 270L53 -163H115L307 267Z"></path></g></svg></span></td><td class="align_left"><span style="width: 15.184ptpx;"><svg height="11.439pt" id="M67" style="vertical-align:-2.15067pt" version="1.1" viewbox="-0.0498162 -9.28833 15.184 11.439" width="15.184pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M162 -163V703H101V-163H162Z"></path></g><g transform="matrix(.013,0,0,-0.013,3.419,0)"><path d="M241 635C89 635 35 457 35 312C35 153 89 -12 240 -12C390 -12 443 166 443 312C443 466 390 635 241 635ZM238 602C329 602 354 454 354 312C354 172 330 22 240 22C152 22 124 173 124 313S148 602 238 602Z"></path></g><g transform="matrix(.013,0,0,-0.013,9.659,0)"><path d="M307 267V273L115 703H53L246 270L53 -163H115L307 267Z"></path></g></svg></span></td><td class="align_left"><span style="width: 15.184ptpx;"><svg height="11.439pt" id="M68" style="vertical-align:-2.15067pt" version="1.1" viewbox="-0.0498162 -9.28833 15.184 11.439" width="15.184pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M162 -163V703H101V-163H162Z"></path></g><g transform="matrix(.013,0,0,-0.013,3.419,0)"><path d="M241 635C89 635 35 457 35 312C35 153 89 -12 240 -12C390 -12 443 166 443 312C443 466 390 635 241 635ZM238 602C329 602 354 454 354 312C354 172 330 22 240 22C152 22 124 173 124 313S148 602 238 602Z"></path></g><g transform="matrix(.013,0,0,-0.013,9.659,0)"><path d="M307 267V273L115 703H53L246 270L53 -163H115L307 267Z"></path></g></svg></span></td></tr><tr><td class="thead-hr" colspan="5"><hr/></td></tr><tr><td class="align_left">Total number of repeated measurements</td><td class="align_center">500</td><td class="align_center">500</td><td class="align_center">500</td><td class="align_center">500</td></tr><tr><td class="align_left">Total number of measurements which gives 1</td><td class="align_center">330</td><td class="align_center">400</td><td class="align_center">350</td><td class="align_center">190</td></tr><tr><td class="align_left">Total number of measurements which gives 0</td><td class="align_center">170</td><td class="align_center">100</td><td class="align_center">150</td><td class="align_center">310</td></tr><tr><td class="align_left">Probability of getting 1 or <i>P</i> (1)</td><td class="align_center">0.66</td><td class="align_center">0.8</td><td class="align_center">0.7</td><td class="align_center">0.38</td></tr><tr><td class="align_left">Probability of getting 0 or <i>P</i> (0)</td><td class="align_center">0.34</td><td class="align_center">0.2</td><td class="align_center">0.3</td><td class="align_center">0.62</td></tr><tr><td class="align_left">Expectation value</td><td class="align_center">0.66</td><td class="align_center">0.8</td><td class="align_center">0.7</td><td class="align_center">0.62</td></tr><tr class="table-tr"><td colspan="5"><hr class="tbody-hr"/></td></tr></table></td></tr></table>

<div>The method of calculating the expectation value for action selection (this is the assumption).</div>

Quantum Engineering

tab3

Table 3

Table 3: Investigating the Effects of Hyperparameters in Quantum-Enhanced Deep Reinforcement Learning