Research Article

Quantum Information Protection Scheme Based on Reinforcement Learning for Periodic Surface Codes

Figure 2

The confrontation network training inputs the syndrome to the convolutional layer, decodes the output feature vector, and then enters into the double-layer fully connected network to optimize the optimal action value, and the two combine to generate the optimal function value.