Toxic Word Analyzer

Authors

  • Dhairya Timbadia  Computer Engineering, Rajiv Gandhi Institute of Technology, Mumbai, Maharashtra, India

DOI:

https://doi.org//10.32628/CSEIT2173123

Keywords:

Toxicity, Tokenization, Speedometer

Abstract

In this generation social media has been a huge part of our lives and there is no need to say that the current generation spend a huge amount of time on their social media accounts. Apart from there being a good social media influencer there are a lot of people who spread hatred among these influencers as well as among each other. I have tried to make a speedometer which would be able to tell the toxicity of the words that are basically used in the input sentence or paragraph. The main processing that would be done on the sentence or the paragraph would be removing punctuation marks, tokenization on the words, ‘Stop’ word removal, bigram creation, matching tokens with predefined dictionary, generating toxicity percent using scaling.

References

  1. Thedora Chu, Max Wang, Kylie Jue. “Comment Abuse Classification with Deep Leraning.” Stanford University.
  2. Karthik Diankar, Roi Riechart, Henry Lieberman. “Modeling the Detection of Textual Cyberbullying.” Massachusetts Institute of Technology, Cambridge MA 02139 USA.
  3. Xin Wang, Yuanchao Li, Chengjie Sun, Baoxum Wang and Xialong Wang. “Polarities of Tweets by Composing Word Embeddings with Long Short Term Memory.” 7th International Joint Conference of Natural Language Processing. July-2005.
  4. S. V. Georgakopoulus, A. G. Vrahatis, S. K. Tasoulis, V. P. Plagianakos. “Convolutional Neural Networks for Toxic Comment Classification.” arXiv:1802.099574v1 cs.CL], 27 Feb 2018.
  5. Kevin Khieu, Neha Narwal. “Detecting and Classifying Toxic Comments.” Stanford University- CSS224N.
  6. C. Nobata, J.Tetreault,A. Thomas, Y. Mehdad and Y.Chang. “Abusive language detection in online user content.”

Downloads

Published

2021-06-30

Issue

Section

Research Articles

How to Cite

[1]
Dhairya Timbadia, " Toxic Word Analyzer, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 7, Issue 3, pp.578-581, May-June-2021. Available at doi : https://doi.org/10.32628/CSEIT2173123