Word-Wise Tri-Lingual Script identification using K-NN and SVM

Authors(2) :-Renuka Devi B, Raghavendra Srinivas

This paper presents the script identification for tri-lingual based on K-NN and SVM classifier. For the proposed three languages were utilized namely: Kannada, Hindi and English. For the experiment 6000 word images dataset has been used it includes 2000 images belongs to each language. For the features LBP features are extracted from word images. The no. of features are 59 obtained from LBP method. For the recognition K-NN and SVM classifier has been used accuracy. The optimum result is for K-NN is 98.38% and for SVM 98.50% are obtained.

Authors and Affiliations

Renuka Devi B
Computer Science Department, JSS Manjunatheshwara Institute ofvUnder-Graduate and Post-Graduate Studies,Vidyagiri, Dharwad, Karnataka, India
Raghavendra Srinivas
Department of Computer Science, University of Horticultural Sciences, Bagalkot, Karnataka, India

Script identification, word wise images, Document Image, K-NN,SVM .

  1. Peeta Basa Pati and A. G. Ramakrishnan, "Word Level Multi-script Identification", Pattern  Recognition Letters, 2008, Vol. 29, pp. 1218-1229.
  2. J. Hochberg, P. Kelly, T Thomas and L Kerns, "Automatic script identification from document images using  clusterbased templates," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol.19, pp.176-181, 1997
  3. Judith Hochberg, Kevin Bowers, Michael Cannon and Patrick Keely, "Script and language identification for hand-written document images," IJDAR, vol.2, pp45-52. 1999.
  4. S. Wood. X. Yao. K.Krishnamurthi and L.Dang "Language identification from for printed text independent of  segmentation," Proc. of Int’l. Conf. on Image Processing, pp.428-431, 1995.
  5. T.N.Tan, "Rotation invariant texture features and their use in automatic script identification,"  IEEE Trans.on Pattern Analysis and Machine Intelligence, vol. 20, pp.751-756, 1998.
  6. G.S.Peake and Tan, "Script and language identification from document images," Proc. of Eighth British Mach.   Vision Conf., vol.2, pp. 230-233, Sept-1997.
  7. A.Busch ,W.W.Boles and S.Sridharan, " Texture for script identification" IEEE Trans. On Pattern Analysis and  MachineIntelligence, 27(11) 1720-173,2005
  8. Gopal Datt Joshi, Saurabh Garg and Jayanthi Sivaswamy,"Script Identification for Indian  Documents", In. Pro. of 7th IAPR workshop on Document Image Systems, (DAS), New Zealand,pp.255-267, 2006.
  9. Peeta Basa Pati and A.G.Ramakrishnan," HVS inspired system for Script Identification in Indian Multi-Script Documents", In Proc. of 7th International Workshop on Document Analysis System, Nelson Newland,pp-380-389, Feb-13-15,2006
  10. D Dhanya, A.G Ramakrishnan and Peeta Basa pati, "Script identification in printed bilingual documents,"  Sadhana, vol.27, part-1, pp. 73-82, 2002

Publication Details

Published in : Volume 2 | Issue 5 | September-October 2017
Date of Publication : 2017-10-31
License:  This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 601-603
Manuscript Number : CSEIT1725108
Publisher : Technoscience Academy

ISSN : 2456-3307

Cite This Article :

Renuka Devi B, Raghavendra Srinivas, "Word-Wise Tri-Lingual Script identification using K-NN and SVM ", International Journal of Scientific Research in Computer Science, Engineering and Information Technology (IJSRCSEIT), ISSN : 2456-3307, Volume 2, Issue 5, pp.601-603, September-October-2017.
Journal URL : http://ijsrcseit.com/CSEIT1725108

Article Preview