Principal Component Analysis and Support Vector Machine approach for Gujarati Handwritten Numeral Recognition

Authors

  • Dr. Mrs. Mamta Jagdish Baheti  Department of Computer Science, Hislop College, Nagpur, Maharashtra, India

Keywords:

Principal Component Analysis, Support Vector Machine, Gujarati, Handwritten, Numeral Recognition

Abstract

In this paper we have proposed an algorithm for recognition of handwritten Gujarati Numerals. While reviewing the reported work, it was found that Gujarati is used all across globe including India. The proposed algorithm is applied to noisy numerals. In the algorithm we have used invariant moments as feature extraction technique and PCA and SVM as classifiers. We compared the results for both the classifiers and found that for our database SVM gave 90.55% which were better results as compared to by PCA 80.6% for Invariant moments as feature extraction technique. These results can be improved over good quality images.

References

  1. Luiz Eduardo Soares Oliveira (2003) “Automatic Recognition of Handwritten Numerical Strings” Ph. D Theses
  2. G. Nagy.(2000) “Twenty years of document image analysis in PAMI”, IEEE Trans. On Pattern Analysis and Machine Intelligence, 22(1):38–62.
  3. C.Y. Suen, M. Berthod, and S. Mori (2003) “Automatic Recognition of Hand printed Characters – The State of The Art,” Proc. IEEE, vol. 68, no. 4, pp. 469-487.
  4. J. Mantas (1986) “An Overview of Character Recognition Methodologies,” Pattern Recognition, vol. 19, no. 6, pp. 425-430.
  5. V.K. Govindan and A.P. Shivaprasad (1990) “Character Recognition – A Review,” Pattern Recognition, vol. 23, no. 7, pp. 671-683.
  6. S. Mori, C.Y. Suen, and K. Yamamoto, “Historical Review of OCR Research and Development,” Proc. IEEE, vol. 80, no. 7, pp. 1029-1058, Jul. 1992.
  7. H. Bunke and P.S.P. Wang (eds.), Handbook of Character Recognition and Document Image Analysis, World Scientiļ¬c Publishing, Singapore, 1997.
  8. S. Antani, L. Agnihotri, “Gujarati character recognition”, in: Document Analysis and Recognition, 1999. ICDAR’99. Proceedings of the Fifth International Conference on, IEEE, pp. 418 – 421.
  9. Prof S K Shah, A Sharma, “Design and implementation of optical character recognition system to recognize gujarati script using template matching",IE(I) Journal ET, Vol 86 , (January 2006).
  10. Hetal R. Thaker, Vivek J. Vyas, and Vaishali G. Sanghvi (2016) “Classification and Recognition of Gujarati Numeral using Nearest Neighbour Approach”, International Journal of Scientific Research in Science, Engineering and Technology, vol. 2(3), pp 49-52.
  11. A. R. Vasant, “Performance Evaluation of Different Image Sizes for Recognizing Offline Handwritten Gujarati Digits using Neural Network Approach,” in 2012 International Conference on Communication Systems and Network Technologies, 2012, pp. 271– 274.
  12. Mukesh M. Goswami, Suman K. Mitra, “Offline handwritten Gujarati numeral recognition using low- level strokes”, International Journal of Applied Pattern Recognition , volume 2 issue 4, DOI: http://dx.doi.org/10.1504/IJAPR.2015.075955
  13. A. K. Sharma, D. M. Adhyaru, T. H. Zaveri and P. B. Thakkar, "Comparative analysis of zoning based methods for Gujarati handwritten numeral recognition," 2015 5th Nirma University International Conference on Engineering (NUiCONE), Ahmedabad, 2015, pp. 1-5. doi: 10.1109/NUICONE.2015.7449632
  14. R. Nagar and S. K. Mitra, "Feature extraction based on stroke orientation estimation technique for handwritten numeral," Advances in Pattern Recognition (ICAPR), 2015 Eighth International Conference on, Kolkata, 2015, pp. 1-6. doi: 10.1109/ICAPR.2015.7050654
  15. A. N. Vyas and M. M. Goswami, "Classification of handwritten Gujarati numerals," Advances in Computing, Communications and Informatics (ICACCI), 2015 International Conference on, Kochi, 2015, pp. 1231-1237. doi: 10.1109/ICACCI.2015.7275781
  16. Mandar Chaudhary,Gitam Shikkenawis, Suman K. Mitra Mukesh Goswami, “Similar looking gujarati printed character recognition using locality preserving projection and artificial neural networks",Third International Conference on Emerging Applications of Information Technology (EAIT),IEEE,(2012), 978-1-4673-1827-3/12.
  17. Mukesh Goswami and Suman Mitra, “Structural feature based classification of printed gujarati characters",Springer-Verlag Berlin Heidelberg,(2013)LNCS 8251, P. Maji et al. (Eds.): PReMI 2013, LNCS 8251, pp. 82–87, 2013.
  18. Lipi Shah, Ripal Patel, Shreyal Patel, Jay Maniar, “Handwritten Character Recognition using Radial Histogram",International Journal of Research in Advent Technology,(2014) Vol.2, No.4, E-ISSN: 2321-9637.
  19. Hetal R. Thaker , C. K. Kumbharana, “Structural feature extraction to recognize some of the offine isolated handwritten Gujarati characters using decision tree classifier", International Journal of Computer Applications(0975 – 8887),(August 2014) Volume 99 – No.15
  20. M. K. Hu, “Visual Pattern Recognition by Moment Invariants”, IRE Trans. On Information Theory, Vol. IT-8, pp. 179-187, 1961.
  21. Trier O D, Jain A K, Taxt T(1996) Feature extraction methods for character recognition- a survey. Pattern Recognition 29:641-662.
  22. Dhandra BV, Hangarge M, Hegadi R, Malemath V S (2006) Handwritten Script Identification Using Fuzzy K-Nearest Neighbor. Proc. of IEEE 1st ICSIP II: 587-591.
  23. Andrews M J (1999) Moment Representation of blobs in 2-D Intensity Images, http//wwwgweepnet/~rocko/Moment/ Accessed 15 June 2007
  24. Ramteke R.J., Borkar P.D. and Mehrotra; “Recognition of isolated Marathi Handwritten Numerals: An Invariant Moments Approach” Proceedings of the International Conference on Cognition and Recognition 2005, pp 482-489
  25. Smith L. I.. A Tutorial on Principal Components Analyzing – Discussion and singular value decomposition, http://www.cs.otago.ac.nz/cosc453/ student_tutorials/principal_components.pdf (2002).
  26. Shlens J. A Tutorial on Principal Component Analysis, available at: http://www.cs.princeton.edu/picasso/mats/PCA-Tutorial-Intuition _jp.pdf (2005).
  27. Jolliffe I. T. Principal Component Analysis, Springer Series in Statistics, 2nd ed., Springer. (2002)..
  28. Kim K. Face Recognition using Principle Component Analysis, DCS, University of Maryland. (2003).
  29. V. Vapnik, “The Nature of Statistical Learning Theory” Springer Verlag, 1994.
  30. C. Burges, “A Tutorial on support Vector machines for pattern recognition” Data mining and knowledge discovery, vol.2,1998,pp.1-43.
  31.  V.N. Vapnik, Statistical Learning Theory. John Wiley and sons, 1998..
  32. http://sybil.ece.ucsb.edu/pages/poorman3dreg/index.html
  33. Nello Cristianini and John Shawe-Taylor, “An Introduction to Support Vector Machines and Other Kernel-based Learning Methods”, Cambridge University Press, 2000.
  34. B. Schölkopf, Support Vector Learning., Ph.D. Thesis, R. Oldenbourg Verlag Publications, Munich, Germany, 1997.
  35. Prasad, J.R. ; Kulkarni, U.V. ; Prasad, R.S. (2009) Template Matching Algorithm for Gujarati Character Recognition In Proc. Of 2nd International Conference on Emerging Trends in Engineering and Technology (ICETET), pp 263 - 268
  36. Desai A A (2010) Gujarati handwritten numeral optical character reorganization through neural network. Pattern Recognition 43: 2582-2589.
  37. Desai A. A. (2010) Handwritten Gujarati Numeral Optical Character Recognition using Hybrid Feature Extraction Technique. IPCV pp 733-739

Downloads

Published

2018-04-30

Issue

Section

Research Articles

How to Cite

[1]
Dr. Mrs. Mamta Jagdish Baheti, " Principal Component Analysis and Support Vector Machine approach for Gujarati Handwritten Numeral Recognition, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 3, pp.2139-2147, March-April-2018.