Novel Features for plagiarism detection in Marathi Language

Authors(3) :-Ramesh R. Naik, Maheshkumar B. Landge, C. Namrata Mahender

Plagiarism is stealing information or idea from someone without giving proper acknowledgement. Currently plagiarism is increasing in different fields like education, industry. There is need to prevent plagiarism. There are four types of stylometric features available namely: lexical, syntactic, semantic, content specific. In this paper we have added three new features for detecting plagiarism namely noun, adjective and rhyming words. For calculating these features, we have used our own Marathi text corpus. These features will be useful for detecting plagiarism and linguistic researchers.

Authors and Affiliations

Ramesh R. Naik
Department of CS and IT, Dr.B.A.M.University, Aurangabad, Maharashtra, India
Maheshkumar B. Landge
Department of CS and IT, Dr.B.A.M.University, Aurangabad, Maharashtra, India
C. Namrata Mahender
Department of CS and IT, Dr.B.A.M.University, Aurangabad, Maharashtra, India

Plagiarism Detection, Feature Extraction, Stylometric Features.

  1. Chris Park. Rebels without a Clause: Towards an Institutional Framework for Dealing with Plagiarism by Students. Journal of Further and Higher Education Vol. 28, No. 3, August 2004.
  2. Alzahrani, S.M., Salim, N. and Abraham, A. (2012), "Understanding plagiarism linguistic patterns, textual features, and detection methods", IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, Vol. 42 No. 2, pp. 133-149.
  3. Chow, T.W.S. and Rahman, M.K.M. (2009), "Multilayer SOM with tree-structured data for efficient document retrieval and plagiarism detection", IEEE Transactions on Neural Networks, Vol. 20 No. 9, pp. 1385-1402.
  4. Ramya, L. and Venkatalakshmi, R. (2013), "Intelligent plagiarism detection", International Journal of Research in Engineering & Advanced Technology, Vol. 1 No. 1, pp. 171-174.
  5. K. Leilei, Q. Haoliang, W. Shuai, D. Cuixia, W. Suhong, and H. Yong, "Approaches for candidate document retrieval and detailed comparison of plagiarism detection," Notebook for PAN at CLEF 2012.
  6. M. Sanchez-Perez, G. Sidorov, and A. Gelbukh, "A winning approach to text alignment for text reuse detection at pan 2014," Notebook for PAN at CLEF, pp. 1004–1011, 2014.

Publication Details

Published in : Volume 3 | Issue 1 | January-February 2018
Date of Publication : 2018-02-28
License:  This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 851-853
Manuscript Number : CSEIT1831192
Publisher : Technoscience Academy

ISSN : 2456-3307

Cite This Article :

Ramesh R. Naik, Maheshkumar B. Landge, C. Namrata Mahender, "Novel Features for plagiarism detection in Marathi Language", International Journal of Scientific Research in Computer Science, Engineering and Information Technology (IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 1, pp.851-853, January-February-2018. |          | BibTeX | RIS | CSV

Article Preview