Research Article

HRNet Encoder and Dual-Branch Decoder Framework-Based Scene Text Recognition Model

Table 4

Parameter comparison in ablation models during training and testing (M).

ModelParameters
TrainingTesting

Baseline (HRNet)35.56435.564
Baseline + SR (Bilinear Interpolation)35.56535.564
Baseline + SR (Bilinear Interpolation) + SAM35.56835.567
Baseline + SR (Trans Conv2D) + SAM35.57335.567
Proposed model (Baseline + SR (Trans Conv2D) + SAM + Independent Trans Conv2D Layers)37.58237.576