Research Article

Hybrid Online and Offline Reinforcement Learning for Tibetan Jiu Chess

Table 2

Average number of squares per 20 steps in 200-step training.

Q-learning + SARSAQ-learningSARSA

860442485571