Research Article

HRNet Encoder and Dual-Branch Decoder Framework-Based Scene Text Recognition Model

Table 2

Comparison of accuracy of ablation models (%).

ModelIIIT5kSVTIC03IC13IC15SVTPCUTE80

Baseline (HRNet)91.788.493.492.278.680.280.9
Baseline + SR (Bilinear Interpolation)93.089.592.792.781.181.178.1
Baseline + SR (Bilinear Interpolation) + SAM93.092.191.993.281.783.381.2
Baseline + SR (Trans Conv2D) + SAM93.491.893.393.681.882.681.6
Proposed model93.791.393.394.382.883.183.0