Research Article
Quantum Information Protection Scheme Based on Reinforcement Learning for Periodic Surface Codes
Figure 2
The confrontation network training inputs the syndrome to the convolutional layer, decodes the output feature vector, and then enters into the double-layer fully connected network to optimize the optimal action value, and the two combine to generate the optimal function value.