Developing the Framework Using Deep Neural Network for Detection of Spam and Fake Spam Messages in Twitter
DOI:
https://doi.org/10.32628/CSEIT2410289Keywords:
Twitter Spam, Multi-classifier, Classification, Bayesian, K-Nearest neighbor, Random forestAbstract
Social media plays vital role among the user communities for social gathering, entertainment, communication, sharing knowledge so on. Twitter is one such network to connect millions of users to share information. Nowadays, there are humpteen numbers of users using social media for social engagements. Due to the fact that wide publicity of individuals and products get viral in social media, everyone wish to use social media as a platform to promote their product. Furthermore, large number of people relies on social media contents to take decisions. Twitter is one of the social media platforms to post the media contents by the user. Spammers are illegal users intrude the twitter account and send the duplicate messages to promote advertisement, phishing, scam and personal blogs etc. In this paper, a novel spam detection mechanism is introduced to detect the suspicious users on twitter. The system has been designed such a way that it initially set with semi-supervised at the tweet level and update into supervised level for learning the input tweets to detect the spammers. The proposed system will also identify the type of spammers and will also remove duplicate tweets. We have applied with multi-classifier algorithms like naïve Bayesian, K-Nearest neighbor and Random forest into twitter data set and the performance is compared. The experimental result shows very promising results.
Downloads
References
Wu, Tingmin, et al., "Twitter spam detection: Survey of new approaches and comparative study”. Computers & Security. 76.10.1016/j.cose.2017.11.013. DOI: https://doi.org/10.1016/j.cose.2017.11.013
M. Jiang, et al., "Suspicious behavior detection: Current trends and future directions," IEEE Intelligent Systems, vol. 31, pp. 31-39,2016. DOI: https://doi.org/10.1109/MIS.2016.5
J. Tanha, et al., "Semi-supervised self-training for decision tree classifiers," International Journal of Machine Learning and Cybernetics,vol. 8, pp. 355-370, 2017. DOI: https://doi.org/10.1007/s13042-015-0328-7
Y. Xia, et al., "A boosted decision tree approach using Bayesian hyper-parameter optimization for credit scoring," Expert Systems withApplications, vol. 78, pp. 225-241, 2017. DOI: https://doi.org/10.1016/j.eswa.2017.02.017
S. Sedhai and A. Sun, "Semi-supervised spam detection in Twitter stream," IEEE Transactions on Computational Social Systems, vol.5, pp. 169-175, 2017. DOI: https://doi.org/10.1109/TCSS.2017.2773581
S. Liu, et al., "Addressing the class imbalance problem in twitter spam detection using ensemble learning," Computers & Security, vol.69, pp. 35-49, 2017. DOI: https://doi.org/10.1016/j.cose.2016.12.004
C. Chen, et al., "Investigating the deceptive information in Twitter spam," Future Generation Computer Systems, vol. 72, pp. 319-326,2017. DOI: https://doi.org/10.1016/j.future.2016.05.036
G. Lin, et al., "Statistical twitter spam detection demystified: performance, stability and scalability," IEEE Access, vol. 5, pp. 11142-11154, 2017. DOI: https://doi.org/10.1109/ACCESS.2017.2710540
C. Chen, et al., "Statistical features-based real-time detection of drifted twitter spam," IEEE Transactions on Information Forensics andSecurity, vol. 12, pp. 914-925, 2016. DOI: https://doi.org/10.1109/TIFS.2016.2621888
A. Singh and S. Batra, "Ensemble based spam detection in social IoT using probabilistic data structures," Future Generation ComputerSystems, vol. 81, pp. 359-371, 2018. DOI: https://doi.org/10.1016/j.future.2017.09.072
C. Li and S. Liu, "A comparative study of the class imbalance problem in Twitter spam detection," Concurrency and Computation:Practice and Experience, vol. 30, p. e4281, 2018. DOI: https://doi.org/10.1002/cpe.4281
R. Aswani, et al., "Detection of spammers in twitter marketing: a hybrid approach using social media analytics and bio inspiredcomputing," Information Systems Frontiers, vol. 20, pp. 515-530, 2018. DOI: https://doi.org/10.1007/s10796-017-9805-8
A. T. Kabakus and R. Kara, "“TwitterSpamDetector”: A Spam Detection Framework for Twitter," International Journal of Knowledgeand Systems Science (IJKSS), vol. 10, pp. 1-14, 2019. DOI: https://doi.org/10.4018/IJKSS.2019070101
X. Wang, et al., "Drifted Twitter Spam Classification Using Multiscale Detection Test on KL Divergence," IEEE Access, vol. 7, pp.108384-108394, 2019. DOI: https://doi.org/10.1109/ACCESS.2019.2932018
B., Mukunthan. (2019). Improved Content Based Medical Image Retrieval using PCA with SURF Features. International Journal ofInnovative Technology and Exploring Engineering. 8. 10.35940/ijitee.J1020.08810S19.
M.Arunkrishna, B.Mukunthan “ Review on Classification of Anti-Spam Solutions : Approaches, Algorithms Demystified.” Studies inIndian Place Names Vol. 40 No. 60 (2020): Vol-40-Issue-60-March-2020 , vol. 40, no. 60, 6 Mar. 2020, pp. 4449–4458.
Mukunthan B, Nagaveni N. Identification of unique repeated patterns, location of mutation in DNA finger printing using artificialintelligence technique. Int J Bioinform Res Appl. 2014;10(2):157‐176. doi:10.1504/IJBRA.2014.059516 DOI: https://doi.org/10.1504/IJBRA.2014.059516
A, Pushpalatha & B, Mukunthan. (2010). Automation of DNA Finger Printing for Precise Pattern Identification using Neural-fuzzyMapping approach. International Journal of Computer Applications. 12. 10.5120/1761-2411. DOI: https://doi.org/10.5120/1761-2411
V. Vishwarupe, et al., "Intelligent Twitter spam detection: a hybrid approach," in Smart Trends in Systems, Security and Sustainability,ed: Springer, 2018, pp. 189-197. DOI: https://doi.org/10.1007/978-981-10-6916-1_17
C.C. Wei and N.S. Hsu,"Derived operating rules for a reservoir operation system: Comparison of decision trees, neural decision treesand fuzzy decision trees,"Water resources research,vol.44, 2008. DOI: https://doi.org/10.1029/2006WR005792
Mukunthan, B. & Nagaveni, N.Nagaveni. (2011). "Automating Identification of Unique Patterns, Mutation in Human DNA usingArtificial Intelligence Technique". International Journal of Computer Applications. 25. 26-34. 10.5120/3003-4038. DOI: https://doi.org/10.5120/3003-4038
H. Tajalizadeh and R. Boostani, "A novel stream clustering framework for spam detection in twitter," IEEE Transactions onComputational Social Systems, vol. 6, pp. 525-534, 2019. DOI: https://doi.org/10.1109/TCSS.2019.2910818
B., Mukunthan. (2019). Improved Content Based Medical Image Retrieval using PCA with SURF Features. International Journal ofInnovative Technology and Exploring Engineering. 8. 10.35940/ijitee.J1020.08810S19. DOI: https://doi.org/10.35940/ijitee.J1020.08810S19
Downloads
Published
Issue
Section
License
Copyright (c) 2024 International Journal of Scientific Research in Computer Science, Engineering and Information Technology
This work is licensed under a Creative Commons Attribution 4.0 International License.