Machine Learning Methods for Content - Classification and Categorization

Authors

  • Prabhat Kubal  Student, Department of Computer Science, S.K.Somaiya College, Somaiya Vidyavihar University, Mumbai, India
  • Prof. Surabhi Thorat  Professor, Department of Computer Science, S.K.Somaiya College, Somaiya Vidyavihar University, Mumbai, India
  • Prof. Swati Maurya  Professor, Department of Computer Science, S.K.Somaiya College, Somaiya Vidyavihar University, Mumbai, India

DOI:

https://doi.org/10.32628/CSEIT217648

Keywords:

Text Detection, Text Analysis, Content Analysis, Social Media, Education, Toxic Comment Classification

Abstract

These days online gatherings and web-based media stages have furnished people with the necessary resources to advance their contemplations and put themselves out there free paying little heed to the kind of language used to communicate those thoughts, in certain examples these internet based remarks contain express language which might hurt the peruser. We likewise evaluate the class irregularity issues related with the dataset by utilizing inspecting procedures and misfortune. Models we applied yield high in general exactness with moderately minimal expense. To diminish the adverse consequence of poisonous remark in everyday life we have endeavored to plan a Toxic Language detector.

References

  1. Maslej Kresnakova, Viera & Sarnovsky, Martin & Butka, Peter & Machova, Kristina. (2020). Comparison of Deep Learning Models and Various Text Pre-Processing Techniques for the Toxic Comments Classification. Applied Sciences. 10. 8631. 10.3390/app10238631.
  2. Maslej Krešňáková, Viera & Sarnovsky, Martin & Butka, Peter & Machova, Kristina. (2020). Comparison of Deep Learning Models and Various Text Pre-Processing Techniques for the Toxic Comments Classification. Applied Sciences. 10. 8631. 10.3390/app10238631.
  3. Pavel, Monirul. (2020). Toxic Comment Classification Implementing Natural Language Processing and Convolutional Neural Networks With Word Embedding Technique.
  4. Gupta, Akash & Nayyar, Anand & Arora, Simrann & Jain, Rachna. (2021). Detection and Classification of Toxic Comments by Using LSTM and Bi-LSTM Approach. 10.1007/978-981-16-3660-8_10.
  5. Georgakopoulos, Spiros & Tasoulis, Sotiris & Vrahatis, Aristidis & Plagianakos, Vassilis. (2020). Convolutional Neural Networks for Twitter Text Toxicity Analysis. 10.1007/978-3-030-16841-4_38.
  6. van Aken, Betty & Risch, Julian & Krestel, Ralf & Löser, Alexander. (2018). Challenges for Toxic Comment Classification: An In-Depth Error Analysis.
  7. Androcec, Darko. (2020). Machine learning methods for toxic comment classification: a systematic review. Acta Universitatis Sapientiae, Informatica. 12. 205-216. 10.2478/ausi-2020-0012.
  8. Pavel, Monirul & Razzak, Razia & Sengupta, Katha & Niloy, Dilshad Kabir & Muquith, Munim & Tan, Siok. (2021). Toxic Comment Classification Implementing CNN Combining Word Embedding Technique. 10.1007/978-981-33-4305-4_65.
  9. Rani, Pooja & Panichella, Sebastiano & Leuenberger, Manuel & Di Sorbo, Andrea & Nierstrasz, Oscar. (2021). How to Identify Class Comment Types? A Multi-language Approach for Class Comments Classification.

Downloads

Published

2021-12-30

Issue

Section

Research Articles

How to Cite

[1]
Prabhat Kubal, Prof. Surabhi Thorat, Prof. Swati Maurya, " Machine Learning Methods for Content - Classification and Categorization" International Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 7, Issue 6, pp.184-189, November-December-2021. Available at doi : https://doi.org/10.32628/CSEIT217648