Pre-Processing Concepts and Techniques for Sentiment Analysis

Authors(2) :-M. Edison, Dr. A. Aloysius

Sentiment Analysis is consider as a big task to analyse people’s opinion, appraisal, and attitudes in the worldly communications. Many of the people can express their emotions with the text, symbols, and variety of ambiguous data through social media networks. Mainly, Twitter permits a 140-character limit to post one’s comments. Therefore, users are posting their comments like ambiguous data. In that case, pre-processing techniques are very helpful to remove the unwanted data from the data set and solve the various research problems in sentiment analysis for supporting the same. This paper mainly deals with the importance of pre-processing concepts and techniques. Especially, pre-processing techniques are given an idea that cautious to select the suitable feature to analyse the sentiments, which gives better result to classify the sentiment words.

Authors and Affiliations

M. Edison
Computer Science, St. Joseph's College (Autonomous), Tiruchirappalli, TamilNadu, India
Dr. A. Aloysius
Assistant Professor, Computer Science, St. Joseph's College (Autonomous), Tiruchirappalli, TamilNadu, India

Pre-Processing, Pre-Processing Tasks, Techniques, Sentiment Analysis, Feature Selection

  1. Vijayarani, Ilamathi and Nithya “Preprocessing Techniques for Text Mining An Overview”, International Journal of Computer Science & Communication Networks (IJCSCN), 2015, pp: 7-16.
  2. Muskan and Dr. Knawal Garg “An Efficient Algorithm for Data Cleaning of Web Logs with spider Navigation Removal”, International Journal of Computer Application (IJCA), 2016, pp: 6-12.
  3. Akrivi Krouska, Christos Troussas and Maria Virvou “The effect of preprocessing techniques on Twitter Sentiment Analysis”, Information Intelligence Systems & Applications (IISA), 7th International Conference on. IEEE, 2016, pp: 1-5.
  4. R. Akila, R. Praveena and PriyaDarsini “Twitter Data Preprocessing Using Natural Language Processing”, South Asian Journal of Engineering and Technology (SAJET), 2017, pp: 46-49.
  7. L. Sunitha, M. Bal Raju and B.Sunil Srinivas “A Comparative Study between Noisy Data and Outlier Data in Data Mining”, International Journal of Current Engineering and Technology (IJCET), 2013, pp: 575-577.
  8. Jaideepsinh K. Raulji, Jatinderkumar R. Saini and Dr. Babasaheb Ambedkar “Stop-Word Removal Algorithm and its Implementation for Sanskrit Language”, International Journal of Computer Applications, 2016, pp: 15-17.
  10. S.S. Baskar , Dr. L. Arockiam and S.Charles “A Systematic Approach on Data Pre-processing In Data Mining”, COMPUSOFT, An international journal of advanced computer technology, 2013, pp: 335-339.
  11. M. Edison and A. Aloysius ‘Lexicon based Acronyms and Emoticons Classification of Sentiment Analysis (SA) on Big Data”, International Journal of Database Theory and Application (IJDTA), 2017, pp: 41-54.

Publication Details

Published in : Volume 2 | Issue 5 | September-October 2017
Date of Publication : 2017-10-31
License:  This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 141-144
Manuscript Number : CSEIT172524
Publisher : Technoscience Academy

ISSN : 2456-3307

Cite This Article :

M. Edison, Dr. A. Aloysius, "Pre-Processing Concepts and Techniques for Sentiment Analysis", International Journal of Scientific Research in Computer Science, Engineering and Information Technology (IJSRCSEIT), ISSN : 2456-3307, Volume 2, Issue 5, pp.141-144, September-October-2017.
Journal URL :

Article Preview