Pre-Processing Concepts and Techniques for Sentiment Analysis
Keywords:
Pre-Processing, Pre-Processing Tasks, Techniques, Sentiment Analysis, Feature SelectionAbstract
Sentiment Analysis is consider as a big task to analyse people’s opinion, appraisal, and attitudes in the worldly communications. Many of the people can express their emotions with the text, symbols, and variety of ambiguous data through social media networks. Mainly, Twitter permits a 140-character limit to post one’s comments. Therefore, users are posting their comments like ambiguous data. In that case, pre-processing techniques are very helpful to remove the unwanted data from the data set and solve the various research problems in sentiment analysis for supporting the same. This paper mainly deals with the importance of pre-processing concepts and techniques. Especially, pre-processing techniques are given an idea that cautious to select the suitable feature to analyse the sentiments, which gives better result to classify the sentiment words.
References
- Vijayarani, Ilamathi and Nithya “Preprocessing Techniques for Text Mining – An Overview”, International Journal of Computer Science & Communication Networks (IJCSCN), 2015, pp: 7-16.
- Muskan and Dr. Knawal Garg “An Efficient Algorithm for Data Cleaning of Web Logs with spider Navigation Removal”, International Journal of Computer Application (IJCA), 2016, pp: 6-12.
- Akrivi Krouska, Christos Troussas and Maria Virvou “The effect of preprocessing techniques on Twitter Sentiment Analysis”, Information Intelligence Systems & Applications (IISA), 7th International Conference on. IEEE, 2016, pp: 1-5.
- R. Akila, R. Praveena and PriyaDarsini “Twitter Data Preprocessing Using Natural Language Processing”, South Asian Journal of Engineering and Technology (SAJET), 2017, pp: 46-49.
- https://www.techopedia.com/definition/14650/data-pre-processing.
- http://www.cs.ccsu.edu/~markov/ccsu_courses/datamining-3.html.
- L. Sunitha, M. Bal Raju and B.Sunil Srinivas “A Comparative Study between Noisy Data and Outlier Data in Data Mining”, International Journal of Current Engineering and Technology (IJCET), 2013, pp: 575-577.
- Jaideepsinh K. Raulji, Jatinderkumar R. Saini and Dr. Babasaheb Ambedkar “Stop-Word Removal Algorithm and its Implementation for Sanskrit Language”, International Journal of Computer Applications, 2016, pp: 15-17.
- http://text-analytics101.rxnlp.com/2014/10/all-about-stop-words-for-text-mining.html.
- S.S. Baskar , Dr. L. Arockiam and S.Charles “A Systematic Approach on Data Pre-processing In Data Mining”, COMPUSOFT, An international journal of advanced computer technology, 2013, pp: 335-339.
- M. Edison and A. Aloysius ‘Lexicon based Acronyms and Emoticons Classification of Sentiment Analysis (SA) on Big Data”, International Journal of Database Theory and Application (IJDTA), 2017, pp: 41-54.
Downloads
Published
Issue
Section
License
Copyright (c) IJSRCSEIT
This work is licensed under a Creative Commons Attribution 4.0 International License.