Research Article

Tamil OCR Conversion from Digital Writing Pad Recognition Accuracy Improves through Modified Deep Learning Architectures

Table 1

Comparison of different methods and languages.

ReferencesModelLanguageDatasetTypeDataset sizeWriting devices

[16]CNNArabicCMATERDBDigits3,000Pen and Paper
[17]CNN BiLSTMArabicKHATTWord325 paragraphPen and Paper
[18]CNN-TLBangla and DevanagiriCMATERDB ICDAR Ph.D. Indic_11Word361 handwritten documentsPen and Paper
[19]GANChineseHIT-MW and ICDARWord1,003 handwritten imagesPen and Paper
[20]CNNTeluguOwn datasetCharacter16 characters and 21 guninthalu, total of 275,520Pen and Paper Touchpad device
ProposedRTSBATamilDWP-H and HP Lab datasetsWord251 writers total of 85,800 imagesTablet

CNN, convolutional neural network; BiLSTM, bidirectional long short-term memory; RTSBA, ResNet two-stage bottleneck architecture; DWP-H, digital writing pad-handwritten; GAN, generative adversarial network; CMATERDB, center for microprocessor applications for training education and research data base; KHATT, KFUPM Handwritten Arabic TexT; ICDAR, international conference on document analysis and recognition; HIT-MW, harbin institute of technology-multiple writers.