Research Article

HRNet Encoder and Dual-Branch Decoder Framework-Based Scene Text Recognition Model

Table 1

The accuracy comparison between the proposed model and recent models (%).

ModelBenchmarkAverage
IIIT5kSVTIC03IC13IC15SVTPCUTE80RegularIrregular

ASTER93.489.594.591.876.178.579.592.378.0
TextSR92.587.293.291.375.677.478.991.077.3
ESIR93.390.291.776.979.683.391.779.9
2DOCR9490.194.392.776.382.386.892.781.8
Bi-STET94.7899693.475.780.682.593.279.6
SEED93.889.692.88081.483.692.081.6
DAN94.389.29593.974.58084.493.179.6
SPIN94.787.693.491.579.179.785.191.881.3
RobustScanner95.388.194.877.179.590.392.782.3
SCGAN949095.693.381.685.178.193.281.6
Proposed model93.791.393.394.382.883.183.093.182.9

Note: bold font is the optimal value in each column, and the underline font is the suboptimal value in each column.