Research Article
Hybrid Online and Offline Reinforcement Learning for Tibetan Jiu Chess
Figure 7
Comparison of learning efficiency with or without 2D normal distribution. (a) 2D normalization off. (b) 2D normalization on. (c) 2D normalization on and making a special board type.
| (a) |
| (b) |
| (c) |