Pre-Processing Concepts and Techniques for Sentiment Analysis

Authors

  • M. Edison  Computer Science, St. Joseph's College (Autonomous), Tiruchirappalli, TamilNadu, India
  • Dr. A. Aloysius  Assistant Professor, Computer Science, St. Joseph's College (Autonomous), Tiruchirappalli, TamilNadu, India

Keywords:

Pre-Processing, Pre-Processing Tasks, Techniques, Sentiment Analysis, Feature Selection

Abstract

Sentiment Analysis is consider as a big task to analyse people’s opinion, appraisal, and attitudes in the worldly communications. Many of the people can express their emotions with the text, symbols, and variety of ambiguous data through social media networks. Mainly, Twitter permits a 140-character limit to post one’s comments. Therefore, users are posting their comments like ambiguous data. In that case, pre-processing techniques are very helpful to remove the unwanted data from the data set and solve the various research problems in sentiment analysis for supporting the same. This paper mainly deals with the importance of pre-processing concepts and techniques. Especially, pre-processing techniques are given an idea that cautious to select the suitable feature to analyse the sentiments, which gives better result to classify the sentiment words.

References

  1. Vijayarani, Ilamathi and Nithya “Preprocessing Techniques for Text Mining – An Overview”, International Journal of Computer Science & Communication Networks (IJCSCN), 2015, pp: 7-16.
  2. Muskan and Dr. Knawal Garg “An Efficient Algorithm for Data Cleaning of Web Logs with spider Navigation Removal”, International Journal of Computer Application (IJCA), 2016, pp: 6-12.
  3. Akrivi Krouska, Christos Troussas and Maria Virvou “The effect of preprocessing techniques on Twitter Sentiment Analysis”, Information Intelligence Systems & Applications (IISA), 7th International Conference on. IEEE, 2016, pp: 1-5.
  4. R. Akila, R. Praveena  and PriyaDarsini “Twitter Data Preprocessing Using Natural Language Processing”, South Asian Journal of Engineering and Technology (SAJET), 2017, pp: 46-49.
  5. https://www.techopedia.com/definition/14650/data-pre-processing.
  6. http://www.cs.ccsu.edu/~markov/ccsu_courses/datamining-3.html.
  7. L. Sunitha, M. Bal Raju and B.Sunil Srinivas “A Comparative Study between Noisy Data and Outlier Data in Data Mining”, International Journal of Current Engineering and Technology (IJCET), 2013, pp: 575-577.
  8. Jaideepsinh K. Raulji, Jatinderkumar R. Saini and Dr. Babasaheb Ambedkar “Stop-Word Removal Algorithm and its Implementation for Sanskrit Language”, International Journal of Computer Applications, 2016, pp: 15-17.
  9. http://text-analytics101.rxnlp.com/2014/10/all-about-stop-words-for-text-mining.html.
  10. S.S. Baskar , Dr. L. Arockiam and S.Charles “A Systematic Approach on Data Pre-processing In Data Mining”, COMPUSOFT, An international journal of advanced computer technology, 2013, pp: 335-339.
  11. M. Edison and A. Aloysius ‘Lexicon based Acronyms and Emoticons Classification of Sentiment Analysis (SA) on Big Data”, International Journal of Database Theory and Application (IJDTA), 2017, pp: 41-54.

Downloads

Published

2017-10-31

Issue

Section

Research Articles

How to Cite

[1]
M. Edison, Dr. A. Aloysius, " Pre-Processing Concepts and Techniques for Sentiment Analysis, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 2, Issue 5, pp.141-144, September-October-2017.