CNN-Bidirectional LSTM Based Optical Character Recognition of Sanskrit Manuscripts : A Comprehensive Systematic Literature Review

Authors

  • Bhavesh Kataria  Research Scholar, Gujarat Technological University, Ahmedabad, Gujarat, India
  • Dr. Harikrishna B. Jethva  Associate Professor, Department of Computer Engineering, Government Engineering College, Patan, Gujarat, India

DOI:

https://doi.org//10.32628/CSEIT2064126

Keywords:

Optical Character Recognition, Convolutional Neural Network, Bidirectional LSTM, Long-Short Term Memory

Abstract

Optical character recognition (OCR) is a technology that allows you to convert different types of documents or images into searchable, editable, and analyzable data. The current work on Sanskrit Character Recognition from Images of Text Documents is one of the most difficult due to similarities in the forms of unique letters, script complexity, non-forte in the representation, and a vast number of symbols. The Devanagari script is used to write the Sanskrit language. There are a variety of approaches for recognizing characters in a scanned image. The present research initiatives highlight the importance and needs of efforts in recognition of printed and handwritten documents written in Sanskrit language. This paper is aims at reviewing the state of various scripts in use including those from medieval to present era and explores the prospective of digital recognition of printed texts and thereby pointing towards futuristic trends in developing restoration software for Sanskrit scripts. Challenge is due to the number of languages and their diverse scripts. The scarcity of digitized linguistic resources makes the task a tougher one. The paper also highlights on the characteristics and challenges of recognition of scripts of Sanskrit origin. Largely the digital recognition is limited to simple numerals and isolated characters. In addition, this review article serves the purpose an optical character recognition (OCR) system that enables to analyse the word recognition and translate various types of Sanskrit documents or images into text using deep learning architectures which include Convolutional Neural Network (CNN) and Bidirectional long-short term memory (Bidirectional LSTM).

References

  1. D. Yadav, S. Sanchez-Cuadrado, and J. Morato, "Optical Character Recognition for Hindi Language Using a Neural-network Approach" Journal of Information Processing Systems, vol. 9, no. 1, pp. 117–140, Mar. 2013.
  2. Ahmed Mahdi Obaid, Hazem M. El Bakry, M.A. Eldosuky, A.I. Shehab, Handwritten Text Recognition System Based on Neural Network, International Journal of Advanced Research in Computer Science & Technology, Vol. 4, Issue 1
  3. Jan. Agravatt, Shweta & Goswami, Mukesh & Agravatt, Hardik. (2015). A New Approach for Zone Identification on Printed Gujarati Text: Vertical Bar Method. International Journal of Software & Hardware Research in Engineering. 3. 89-93.
  4. V. A. Naik and A. A. Desai, "Online handwritten Gujarati character recognition using SVM, MLP, and K-NN," in 2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT), 2017
  5. N. Prameela, P. Anjusha, and R. Karthik, "Off-line Telugu handwritten characters recognition using optical character recognition," in 2017 IEEE International conference of Electronics, Communication and Aerospace Technology (ICECA), 2017.
  6. A. Desai, "Gujarati handwritten numeral optical character reorganization through neural network," Pattern Recognition, vol. 43, no. 7, pp. 2582–2589, Jul. 2010.
  7. Chaudhary, G. Shikkenawis, S. K. Mitra, and M. Goswami, "Similar looking Gujarati printed character recognition using Locality Preserving Projection and artificial neural networks," in 2012 Third International Conference on Emerging Applications of Information Technology, 2012.
  8. Baoguang Shi, Xiang Bai, and Cong Yao. An End-toEnd Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition. IEEE transactions on pattern analysis and machine intelligence, 39(11):2298–2304, 2016.
  9. R. Rojas, Neural Networks — A Systematic Introduction (Springer, Berlin, 1996).
  10. Paul, I. J. L., Sasirekha, S., Vishnu, D. R., & Surya, K. (2019). Recognition of handwritten text using long short term memory (LSTM) recurrent neural network (RNN). doi:10.1063/1.5097522
  11. M. Schuster and K. K. Paliwal, Bidirectional recurrent neural networks, IEEE Trans. Sign. Process. 45 (1997) 2673–2681.
  12. M. Avadesh and N. Goyal, "Optical Character Recognition for Sanskrit Using Convolution Neural Networks," 2018 13th IAPR International Workshop on Document Analysis Systems (DAS), 2018, pp. 447-452, doi: 10.1109/DAS.2018.50.
  13. R. Plamondon and S.N. Srihari. On-line and off-line handwriting recognition: A comprehensive survey. IEEE Trans. Pattern Analysis and Machine Intelligence, 22(1):63–84, 2000.\
  14. B. B. Chaudhuri. On OCR of a printed Indian script. In Digital Document Processing: Major Directions and Recent Advances, B. B. Chaudhuri (ed.), Springer-Verlag London Ltd, pages 99–119, 2007
  15. R. Smith, D. Antonova, and D.-S. Lee, "Adapting the Tesseract open source OCR engine for multilingual OCR," in Proceedings of the International Workshop on Multilingual OCR - MOCR ’09, 2009
  16. B. B. Chaudhuri, U. Pal. An OCR System to Read Two Indian Language Scripts: Bangla and Devanagari (Hindi). 0-8 186-7898-4/97 $10.00 0 1997 IEEE.
  17. Rahiman, M. Abdul, and M. S. Rajasree. "A detailed study and analysis of ocr research in south indian scripts." In Advances in Recent Technologies in Communication and Computing, 2009. ARTCom’09. International Conference on, pp. 31-38. IEEE, 2009.
  18. C. V. Jawahar, M. N. S. S. K. Pavan Kumar, S. S. Ravi Kiran. A Bilingual OCR for Hindi-Telugu Documents and its Applications. ICDAR 2003, 0-7695-1960-1/03 $17.00 © 2003, IEEE
  19. Mahmud, J.U.; Raihan, M.F.; Rahman, C.M., "A complete OCR system for continuous Bengali characters," TENCON 2003. Conference on Convergent Technologies for the Asia-Pacific Region , vol.4, no., pp.1372,1376 Vol.4, 15-17 Oct. 2003 doi: 10.1109/TENCON.2003.1273141
  20. Kiran R Dahake, S R Suralkar and S P Ramteke. Article: Optical Character Recognition for Marathi Text Newsprint. International Journal of Computer Applications 62(16):11-15, January 2013
  21. Malik, S.; Khan, S.A., "Urdu online handwriting recognition," Emerging Technologies, 2005. Proceedings of the IEEE Symposium on , vol., no., pp.27,31, 18-18 Sept. 2005 doi: 10.1109/ICET.2005.1558849
  22. Brijesh Sojitra, Vishnukumar Dhakad, "Neural Network In Character Recognition Of Gujarati Script"Journal Of Information, Knowledge And Research In Computer Engineering, ISSN: 0975- 6760, Volume - 02, Issue - 02, Pp.269-272
  23. Vijaykumar, B.; Ramakrishnan, A.G., "Radial basis function and subspace approach for printed Kannada text recognition," Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on , vol.5, no., pp.V,321-4 vol.5, 17-21 May 2004 doi: 10.1109/ICASSP.2004.1327112
  24. Rajput, G.G.; Horakeri, R., "Shape descriptors based handwritten character recognition engine with application to Kannada characters," Computer and Communication Technology (ICCCT), 2011 2nd International Conference on , vol., no., pp.135,141, 15-17 Sept. 2011 doi: 10.1109/ICCCT.2011.6075175
  25. Anil R, Arjun Pradeep, Midhun E M, Manjusha K, "Malayalam Character Recognition using Singular Value Decomposition", International Journal of Computer Applications, ISSN:0975 - 8887, Volume 92 - No.12, April 2014, pp-6-11.
  26. R. Maalej and M. Kherallah, "Convolutional Neural Network and BLSTM for Offline Arabic Handwriting Recognition," 2018 International Arab Conference on Information Technology (ACIT), Werdanye, Lebanon, 2018, pp. 1-6
  27. P. M. Kamble and R. S. Hegadi, "Handwritten Marathi Character Recognition Using R-HOG Feature," Procedia Computer Science, vol. 45, pp. 266–274, 2015.
  28. Yoon, Byung-Jun. "Hidden Markov Models and their Applications in Biological Sequence Analysis." Current genomics vol. 10,6 (2009): 402-15. doi:10.2174/138920209789177575
  29. D. Yadav, S. Sanchez-Cuadrado, and J. Morato, "Optical Character Recognition for Hindi Language Using a Neural-network Approach" Journal of Information Processing Systems, vol. 9, no. 1, pp. 117–140, Mar. 2013.
  30. Agravatt, Shweta & Goswami, Mukesh & Agravatt, Hardik. (2015). A New Approach for Zone Identification on Printed Gujarati Text: Vertical Bar Method. International Journal of Software & Hardware Research in Engineering. 3. 89-93.
  31. K. R.Dahake, S. R. Suralkar, and S. P. Ramteke, "Optical Character Recognition for Marathi Text Newsprint," International Journal of Computer Applications, vol. 62, no. 16, pp. 11–15, Jan. 2013.
  32. P. M. Kamble and R. S. Hegadi, "Handwritten Marathi Character Recognition Using R-HOG Feature," Procedia Computer Science, vol. 45, pp. 266–274, 2015.
  33. S. Aghav and S. S. Paygude, "Computer Assisted Printed Character Recognition in Document Based Images," Procedia Engineering, vol. 38, pp. 3222–3227, 2012.
  34. Ahmed Mahdi Obaid, Hazem M. El Bakry, M.A. Eldosuky, A.I. Shehab, Handwritten Text Recognition System Based on Neural Network, International Journal of Advanced Research inComputer Science & Technology, Vol. 4, Issue 1 (Jan. - Mar. 2016) pp 72-77
  35. N. S. Panyam, V. L. T.R., R. Krishnan, and K. R. N.V., "Modeling of palm leaf character recognition system using transform based techniques," Pattern Recognition Letters, vol. 84, pp. 29–34, Dec. 2016.
  36. K. S. Varghese, A. James, and S. Chandran, "A Novel Tri-Stage Recognition Scheme for Handwritten Malayalam Character Recognition," Procedia Technology, vol. 24, pp. 1333–1340, 2016.
  37. V. A. Naik and A. A. Desai, "Online handwritten Gujarati character recognition using SVM, MLP, and K-NN," in 2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT), 2017.
  38. Xiaoqing Ding, Li Chen, and Tao Wu, "Character Independent Font Recognition on a Single Chinese Character," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 29, no. 2, pp. 195–204, Feb. 2007
  39. S. Bag, G. Harit, and P. Bhowmick, "Recognition of Bangla compound characters using structural decomposition," Pattern Recognition, vol. 47, no. 3, pp. 1187–1201, Mar. 2014.
  40. A. A. Desai, "Gujarati handwritten numeral optical character reorganization through neural network," Pattern Recognition, vol. 43, no. 7, pp. 2582–2589, Jul. 2010.
  41. N. Prameela, P. Anjusha, and R. Karthik, "Off-line Telugu handwritten characters recognition using optical character recognition," in 2017 International conference of Electronics, Communication and Aerospace Technology (ICECA), 2017.
  42. O. Surinta, M. F. Karaaba, L. R. B. Schomaker, and M. A. Wiering, "Recognition of handwritten characters using local gradient feature descriptors," Engineering Applications of Artificial Intelligence, vol. 45, pp. 405–414, Oct. 2015.
  43. S. M. Obaidullah, A. Mondal, and K. Roy, "Structural feature based approach for script identification from printed Indian document," in 2014 International Conference on Signal Processing and Integrated Networks (SPIN), 2014.
  44. Chaudhary, G. Shikkenawis, S. K. Mitra, and M. Goswami, "Similar looking Gujarati printed character recognition using Locality Preserving Projection and artificial neural networks," in 2012 Third International Conference on Emerging Applications of Information Technology, 2012.
  45. R. Sarkhel, N. Das, A. Das, M. Kundu, and M. Nasipuri, "A multi-scale deep quad tree based feature extraction method for the recognition of isolated handwritten characters of popular indic scripts," Pattern Recognition, vol. 71, pp. 78–93, Nov. 2017.
  46. J. Singh, G. Singh, R. Singh, and P. Singh, "Morphological evaluation and sentiment analysis of Punjabi text using deep learning classification," Journal of King Saud University - Computer and Information Sciences, Apr. 2018.
  47. K.S.Sesh Kumar, Anoop M. Namboodiri and C. V. Jawahar - Learning Segmentation of Documents with Complex Scripts, 5th Indian Conference on Computer Vision, Graphics and Image Processing, Madurai, India LNCS 4338 pp.749-760, 2006.
  48. Sachin Rawat, K. S. Sesh Kumar, Million Meshesha, Indineel Deb Sikdar, A. Balasubramanian and C. V. Jawahar - A Semi-Automatic Adaptive OCR for Digital Libraries, Proceedings of Seventh IAPR Workshop on Document Analysis Systems, 2006 (LNCS 3872), pp 13-24
  49. Sameer Antani and Lalitha Agnihotri. 1999. Gujarati Character Recognition. In Proceedings of the Fifth International Conference on Document Analysis and Recognition (ICDAR '99). IEEE Computer Society, Washington, DC, USA
  50. Narang, S., Jindal, M.K. & Kumar, M. Sādhanā (2019) 44: 141. https://doi.org/10.1007/s12046-019-1126-9
  51. Ahmed, S. B., Naz, S., Razzak, M. I., Yusof, R., & Breuel, T. M. (2016). Balinese Character Recognition Using Bidirectional LSTM Classifier. Advances in Machine Learning and Signal Processing, 201–211. doi:10.1007/978-3-319-32213-1_18
  52. K. R.Dahake, S. R. Suralkar, and S. P. Ramteke, "Optical Character Recognition for Marathi Text Newsprint," International Journal of Computer Applications, vol. 62, no. 16, pp. 11–15, Jan. 2013.
  53. S. Aghav and S. S. Paygude, "Computer Assisted Printed Character Recognition in Document Based Images," Procedia Engineering, vol. 38, pp. 3222–3227, 2012.- Mar. 2016) pp 72-77
  54. K. S. Varghese, A. James, and S. Chandran, "A Novel Tri-Stage Recognition Scheme for Handwritten Malayalam Character Recognition" Procedia Technology, vol. 24, pp. 1333–1340, 2016.
  55. J. Singh, G. Singh, R. Singh, and P. Singh, "Morphological evaluation and sentiment analysis of Punjabi text using deep learning classification," Journal of King Saud University - Computer and Information Sciences, Apr. 2018.
  56. K.S.Sesh Kumar, Anoop M. Namboodiri and C. V. Jawahar - Learning Segmentation of Documents with Complex Scripts, 5th Indian Conference on Computer Vision, Graphics and Image Processing, Madurai, India LNCS 4338 pp.749-760, 2006.
  57. Bansal, Veena & Sinha, R.M.K.. (2000). Integrating knowledge sources in Devanagari text recognition system. Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on. 30. 500 - 505. 10.1109/3468.852443.
  58. Dutta, K., Krishnan, P., Mathew, M., & Jawahar, C.V. (2018). Offline Handwriting Recognition on Devanagari Using a New Benchmark Dataset. 2018 13th IAPR International Workshop on Document Analysis Systems (DAS), 25-30.
  59. Holley, Rose. (2009). How good can it get? Analysing and improving OCR accuracy in large scale historic newspaper digitisation programs. D-Lib Magazine: The Magazine of the Digital Library Forum. 15.
  60. Saluja, R., Adiga, D., Chaudhuri, P., Ramakrishnan, G., & Carman, M.J. (2017). Error Detection and Corrections in Indic OCR Using LSTMs. 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), 01, 17-22.
  61. R. A. Khan, A. Crenn, A. Meyer, and S. Bouakaz, "A novel database of children’s spontaneous facial expressions (LIRIS-CSE)," Image Vis. Comput., vols. 83–84, pp. 61–69, Mar. 2019
  62. R. Hussain, A. Raza, I. Siddiqi, K. Khurshid, and C. Djeddi, "A comprehensive survey of handwritten document benchmarks: Structure, usage and evaluation," EURASIP J. Image Video Process., vol. 2015, no. 1, p. 46, Dec. 2015.
  63. R. A. Khan, É. Dinet, and H. Konik, "Visual attention: Effects of blur," in Proc. IEEE Int. Conf. Image Process., Sep. 2011, pp. 3289–3292.
  64. S. Naz, A. I. Umar, R. Ahmad, I. Siddiqi, S. B. Ahmed, M. I. Razzak, and F. Shafait, "Urdu nastaliq recognition using convolutional–recursive deep learning," Neurocomputing, vol. 243, pp. 80–87, Jun. 2017.
  65. M. Al-Ayyoub, A. Nuseir, K. Alsmearat, Y. Jararweh, and B. Gupta, "Deep learning for Arabic NLP: A survey," J. Comput. Sci., vol. 26, pp. 522–531, May 2018.
  66. S. Long, X. He, and C. Yao, "Scene text detection and recognition : The deep learning era," 2018, arXiv:1811.04256. [Online]. Available: https://arxiv.org/abs/1811.04256
  67. C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, "Going deeper with convolutions," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), Jun. 2015, pp. 1–9.
  68. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classifi cation with deep convolutional neural networks," in Proc. 25th Int. Conf. Neural Inf. Process. Syst. (NIPS), vol. 1. Red Hook, NY, USA: Curran Associates, 2012, pp. 1097–1105. [Online]. Available: http://dl.acm.org/citation.cfm?id=2999134.2999257
  69. K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), Jun. 2016, pp. 770–778.
  70. A. Chaudhuri et al., Optical Character Recognition Systems for Different Languages with Soft Computing, Studies in Fuzziness and Soft Computing 352, Springer International Publishing AG 2017DOI 10.1007/978-3-319-50252-6_2
  71. X. Jiang, H. Bunke, and D. Widmer-Kljajo, Proc. of the Fifth International Conference on Document Analysis and Recognition, Bangalore, pp.629-632, 1999.
  72. J. Kumar, T. Kasar, and A. G. Ramakrishnan, IEEE TENCON 2007, pp.1-4, 2007
  73. S. Srihari and V.Govindaraju, Machine Vision and Applications, 2:141-153, 1989.
  74. C. Singha, N. Bhatiab, and A. Kaurc, Pattern Recognition, Vol. 41, pp. 3528-3546, 2008.
  75. Gatos, B., Ntirogiannis, K., Perantonis S.J. Improved document image binarization by using a combination of multiple binarization techniques and adapted edge information. In Proceedings of 19th International Conference on Pattern Recognition (ICPR), pp. 1-4 (2008).
  76. R. Kasturi, L. O. Gorman, and V. Govindaraju, “Document image analysis: A primer,” Sadhana Part 1, vol. 27, pp. 3–22, 2002.
  77. HOG feature extraction ,extractHOGFeatures, Description of HOG descriptor. Matlab function command description, Link:https://in.mathworks.com/help/vision/ref/extracthogfeature s.html?s-tid=srchtitle.
  78. Rosenblatt, F.: The Perceptron: A probabilistic model or information storageand organization in the brain. Psychological Review 65, 386–408 (1958)
  79. Drucker, P.F.: The age of discontinuity; guidelines to our changing society, 1st edn. Harper and Row New York (1969).
  80. Jain, A.K., Duin, R.P.W., Mao, J.: Statistical pattern recognition: A review. IEEE Trans. Pattern Anal. Mach. Intell. 22(1) (2000) 4-37.
  81. Fukunaga, K.: Introduction to Statistical Pattern Recognition. Academic Press, 2nd edition (1990).
  82. Altwaijry, Najwa; Al-Turaiki, Isra (2020). Arabic handwriting recognition system using convolutional neural network. Neural Computing and Applications doi:10.1007/s00521-020-05070-8
  83. LeCun Y., Jackel L., Boser B., Denker J., Graf H., Guyon I., Henderson D., Howard R., and Hubbard W. Handwritten digit recognition : Applications of neural networks chipsand automatic learning. Proceedings of the IEEE, 86(11):2278–2324, 1998.

Downloads

Published

2019-03-30

Issue

Section

Research Articles

How to Cite

[1]
Bhavesh Kataria, Dr. Harikrishna B. Jethva, " CNN-Bidirectional LSTM Based Optical Character Recognition of Sanskrit Manuscripts : A Comprehensive Systematic Literature Review, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 5, Issue 2, pp.1362-1383, March-April-2019. Available at doi : https://doi.org/10.32628/CSEIT2064126