Research Article

PF-ViT: Parallel and Fast Vision Transformer for Offline Handwritten Chinese Character Recognition

Table 4

Performance of different models on the DHWDB dataset: parameters; FLOPs; accuracy.

MethodsNumber of encoder layers per channelEpochs#Params (M)FLOPs (G)Acc. (%)

T-ViT330043.114.3298.1
430057.285.7298.3
630085.628.5298.6

F-ViT230057.282.9496.6
330085.624.3697.3
6300170.638.6197.7

S-ViT230099.792.9996.3
3300148.384.4397.1
4300198.985.8697.0