Word-Wise Tri-Lingual Script identification using K-NN and SVM

Authors

  • Renuka Devi B  Computer Science Department, JSS Manjunatheshwara Institute ofvUnder-Graduate and Post-Graduate Studies,Vidyagiri, Dharwad, Karnataka, India
  • Raghavendra Srinivas  Department of Computer Science, University of Horticultural Sciences, Bagalkot, Karnataka, India

Keywords:

Script identification, word wise images, Document Image, K-NN,SVM .

Abstract

This paper presents the script identification for tri-lingual based on K-NN and SVM classifier. For the proposed three languages were utilized namely: Kannada, Hindi and English. For the experiment 6000 word images dataset has been used it includes 2000 images belongs to each language. For the features LBP features are extracted from word images. The no. of features are 59 obtained from LBP method. For the recognition K-NN and SVM classifier has been used accuracy. The optimum result is for K-NN is 98.38% and for SVM 98.50% are obtained.

References

  1. Peeta Basa Pati and A. G. Ramakrishnan, "Word Level Multi-script Identification", Pattern  Recognition Letters, 2008, Vol. 29, pp. 1218-1229.
  2. J. Hochberg, P. Kelly, T Thomas and L Kerns, "Automatic script identification from document images using  clusterbased templates," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol.19, pp.176-181, 1997
  3. Judith Hochberg, Kevin Bowers, Michael Cannon and Patrick Keely, "Script and language identification for hand-written document images," IJDAR, vol.2, pp45-52. 1999.
  4. S. Wood. X. Yao. K.Krishnamurthi and L.Dang "Language identification from for printed text independent of  segmentation," Proc. of Int’l. Conf. on Image Processing, pp.428-431, 1995.
  5. T.N.Tan, "Rotation invariant texture features and their use in automatic script identification,"  IEEE Trans.on Pattern Analysis and Machine Intelligence, vol. 20, pp.751-756, 1998.
  6. G.S.Peake and Tan, "Script and language identification from document images," Proc. of Eighth British Mach.   Vision Conf., vol.2, pp. 230-233, Sept-1997.
  7. A.Busch ,W.W.Boles and S.Sridharan, " Texture for script identification" IEEE Trans. On Pattern Analysis and  MachineIntelligence, 27(11) 1720-173,2005
  8. Gopal Datt Joshi, Saurabh Garg and Jayanthi Sivaswamy,"Script Identification for Indian  Documents", In. Pro. of 7th IAPR workshop on Document Image Systems, (DAS), New Zealand,pp.255-267, 2006.
  9. Peeta Basa Pati and A.G.Ramakrishnan," HVS inspired system for Script Identification in Indian Multi-Script Documents", In Proc. of 7th International Workshop on Document Analysis System, Nelson Newland,pp-380-389, Feb-13-15,2006
  10. D Dhanya, A.G Ramakrishnan and Peeta Basa pati, "Script identification in printed bilingual documents,"  Sadhana, vol.27, part-1, pp. 73-82, 2002

Downloads

Published

2017-10-31

Issue

Section

Research Articles

How to Cite

[1]
Renuka Devi B, Raghavendra Srinivas, " Word-Wise Tri-Lingual Script identification using K-NN and SVM , IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 2, Issue 5, pp.601-603, September-October-2017.