Intelligent Essay Grading System using Hybrid Text Processing Techniques
DOI:
https://doi.org/10.32628/CSEIT206547Keywords:
Enhanced Latent Semantic Analysis (ELSA), Essay Grading System (EGS), Inverse Document Frequency (IDF), Natural Language Processing (NLP), Part of Speech (POS) Tagging.Abstract
Educational Institutions are facing enormous tasks of marking and grading students at the end of every examination within the shortest possible time. Marking theoretical essay questions which involves thousands of examinees can be biased, subjective and time-consuming, leading to variation in grades awarded by different human assessors. This study presents an Essay Grading System called Intelligent Natural Language Processing Essay Grading System (iNLPEGS) with high accuracy percentage and minimal loss function for scoring assessment that can accommodate more robust questions. Secondary dataset collected from Kaggle provided by The Hewlett Foundation was used to aid semantic analysis and Part of Speech tagging. Assemblage of Computer Science questions and answers were collected from Babcock University Computer Science Department to create a more robust dataset to ensure high reliability. An Intelligent Natural Language Processing Essay Grading Model was designed based on Enhanced Latent Semantic Analysis using Part of Speech n-gram Inverse Document Frequency. Web based application was developed using Django, Gensim, Jupyter Notebook and Anaconda as the development tools due to availability of several Python libraries with SQLite as the database. Results of performance evaluation of iNLPEGS showed accuracy of 89.03% and error of 10.97% connoting that there is very little difference between scores from the developed intelligent essay grading system and a human grader. Also, the loss function from Root Mean Square Error (RSME) showed value of 0.620 which is very small and thus signifies closeness to the line of best fit from the regression equation.
References
- Attali and J. C. Burstein (2006), “Automated Essay Scoring with e-rater V.2”, The Journal of Technology, Learning and Assessment (JTLA), 4(3).
- Catarina and R. Bernardete (2003), “The importance of stop word removal on recall values in text categorization”, In Neural Networks, 2003. Proceedings of the International Joint Conference on, IEEE 3, 1661–1666.
- Harika, , I. S. Latha, Sai V.L., P.S. Krishna and M. Suneetha (2017), “Automated Essay Grading using Feature Selection”, International Research Journal of Engineering and Technology (IRJET) 4(3).
- Hassan, F. Miriam, H. Yulan and A. Harith, (2014), “On stop words, filtering and data sparsity for sentiment analysis of twitter”. In Proceeding Irec conf 2014.
- Hearst (2000), “The debate on Automated Essay Grading”, IEEE Intelligent Systems, 15(5), 2237, IEEE CS Press.
- W. Jonathan and K. Chunyu (1992), “Tokenization as the initial phase in NLP”, In Proceedings of the 14th conference on Computational Linguistics-Volume 4. Association for Computational Linguistics, 1106–1110.
- Khare, (2018), “Applications of Natural Language Processing for Businesses in 2018”, Available at: https://www.upgrad.com/blog/5-applications-of-natural-language-processing-forbusinesses-in-2018/
- B. Page, (1966), “The Imminence of Grading Essays by Computer”, Phi Delta Kappan, 48:238-243.
- Rouse, (2017), “Natural Language Processing (NLP)? Search Business Analytics”, Available: https://searchbusinessanalytics.techtarget.com/definition/natural-languageprocessing-NLP.
- The Hewlett Foundation (2012): Automated Essay Scoring. Available at: https://www.kaggle.com/c/asap-aes
- TwinWord (2018). 4 Applications of Natural Language Processing. Available at: https://www.twinword.com/blog/some-common-applications-of-natural-languageprocessing/
Downloads
Published
Issue
Section
License
Copyright (c) IJSRCSEIT

This work is licensed under a Creative Commons Attribution 4.0 International License.