Novel Features for plagiarism detection in Marathi Language

Authors

  • Ramesh R. Naik  Department of CS and IT, Dr.B.A.M.University, Aurangabad, Maharashtra, India
  • Maheshkumar B. Landge  Department of CS and IT, Dr.B.A.M.University, Aurangabad, Maharashtra, India
  • C. Namrata Mahender  Department of CS and IT, Dr.B.A.M.University, Aurangabad, Maharashtra, India

Keywords:

Plagiarism Detection, Feature Extraction, Stylometric Features.

Abstract

Plagiarism is stealing information or idea from someone without giving proper acknowledgement. Currently plagiarism is increasing in different fields like education, industry. There is need to prevent plagiarism. There are four types of stylometric features available namely: lexical, syntactic, semantic, content specific. In this paper we have added three new features for detecting plagiarism namely noun, adjective and rhyming words. For calculating these features, we have used our own Marathi text corpus. These features will be useful for detecting plagiarism and linguistic researchers.

References

  1. Chris Park. Rebels without a Clause: Towards an Institutional Framework for Dealing with Plagiarism by Students. Journal of Further and Higher Education Vol. 28, No. 3, August 2004.
  2. Alzahrani, S.M., Salim, N. and Abraham, A. (2012), "Understanding plagiarism linguistic patterns, textual features, and detection methods", IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, Vol. 42 No. 2, pp. 133-149.
  3. Chow, T.W.S. and Rahman, M.K.M. (2009), "Multilayer SOM with tree-structured data for efficient document retrieval and plagiarism detection", IEEE Transactions on Neural Networks, Vol. 20 No. 9, pp. 1385-1402.
  4. Ramya, L. and Venkatalakshmi, R. (2013), "Intelligent plagiarism detection", International Journal of Research in Engineering & Advanced Technology, Vol. 1 No. 1, pp. 171-174.
  5. K. Leilei, Q. Haoliang, W. Shuai, D. Cuixia, W. Suhong, and H. Yong, "Approaches for candidate document retrieval and detailed comparison of plagiarism detection," Notebook for PAN at CLEF 2012.
  6. M. Sanchez-Perez, G. Sidorov, and A. Gelbukh, "A winning approach to text alignment for text reuse detection at pan 2014," Notebook for PAN at CLEF, pp. 1004–1011, 2014.

Downloads

Published

2018-02-28

Issue

Section

Research Articles

How to Cite

[1]
Ramesh R. Naik, Maheshkumar B. Landge, C. Namrata Mahender, " Novel Features for plagiarism detection in Marathi Language, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 1, pp.851-853, January-February-2018.