Developing the Framework Using Deep Neural Network for Detection of Spam and Fake Spam Messages in Twitter

N. Anil Kumar; Thatha Anusha; Nagamalla Durga Prasad; Maddala Bala Manikanta; Boddu Swathi

doi:10.32628/CSEIT2410289

Authors

N. Anil Kumar Associate Professor, Department of CSE, Sri Vasavi Institute of Engineering & Technology, Nandamuru, Andhra Pradesh, India Author
Thatha Anusha Department of CSE, Sri Vasavi Institute of Engineering & Technology, Nandamuru, Andhra Pradesh, India Author
Nagamalla Durga Prasad Department of CSE, Sri Vasavi Institute of Engineering & Technology, Nandamuru, Andhra Pradesh, India Author
Maddala Bala Manikanta Department of CSE, Sri Vasavi Institute of Engineering & Technology, Nandamuru, Andhra Pradesh, India Author
Boddu Swathi Department of CSE, Sri Vasavi Institute of Engineering & Technology, Nandamuru, Andhra Pradesh, India Author

DOI:

https://doi.org/10.32628/CSEIT2410289

Keywords:

Twitter Spam, Multi-classifier, Classification, Bayesian, K-Nearest neighbor, Random forest

Abstract

Social media plays vital role among the user communities for social gathering, entertainment, communication, sharing knowledge so on. Twitter is one such network to connect millions of users to share information. Nowadays, there are humpteen numbers of users using social media for social engagements. Due to the fact that wide publicity of individuals and products get viral in social media, everyone wish to use social media as a platform to promote their product. Furthermore, large number of people relies on social media contents to take decisions. Twitter is one of the social media platforms to post the media contents by the user. Spammers are illegal users intrude the twitter account and send the duplicate messages to promote advertisement, phishing, scam and personal blogs etc. In this paper, a novel spam detection mechanism is introduced to detect the suspicious users on twitter. The system has been designed such a way that it initially set with semi-supervised at the tweet level and update into supervised level for learning the input tweets to detect the spammers. The proposed system will also identify the type of spammers and will also remove duplicate tweets. We have applied with multi-classifier algorithms like naïve Bayesian, K-Nearest neighbor and Random forest into twitter data set and the performance is compared. The experimental result shows very promising results.

Downloads

Download data is not yet available.

References

Wu, Tingmin, et al., "Twitter spam detection: Survey of new approaches and comparative study”. Computers & Security. 76.10.1016/j.cose.2017.11.013. DOI: https://doi.org/10.1016/j.cose.2017.11.013

M. Jiang, et al., "Suspicious behavior detection: Current trends and future directions," IEEE Intelligent Systems, vol. 31, pp. 31-39,2016. DOI: https://doi.org/10.1109/MIS.2016.5

J. Tanha, et al., "Semi-supervised self-training for decision tree classifiers," International Journal of Machine Learning and Cybernetics,vol. 8, pp. 355-370, 2017. DOI: https://doi.org/10.1007/s13042-015-0328-7

Y. Xia, et al., "A boosted decision tree approach using Bayesian hyper-parameter optimization for credit scoring," Expert Systems withApplications, vol. 78, pp. 225-241, 2017. DOI: https://doi.org/10.1016/j.eswa.2017.02.017

S. Sedhai and A. Sun, "Semi-supervised spam detection in Twitter stream," IEEE Transactions on Computational Social Systems, vol.5, pp. 169-175, 2017. DOI: https://doi.org/10.1109/TCSS.2017.2773581

S. Liu, et al., "Addressing the class imbalance problem in twitter spam detection using ensemble learning," Computers & Security, vol.69, pp. 35-49, 2017. DOI: https://doi.org/10.1016/j.cose.2016.12.004

C. Chen, et al., "Investigating the deceptive information in Twitter spam," Future Generation Computer Systems, vol. 72, pp. 319-326,2017. DOI: https://doi.org/10.1016/j.future.2016.05.036

G. Lin, et al., "Statistical twitter spam detection demystified: performance, stability and scalability," IEEE Access, vol. 5, pp. 11142-11154, 2017. DOI: https://doi.org/10.1109/ACCESS.2017.2710540

C. Chen, et al., "Statistical features-based real-time detection of drifted twitter spam," IEEE Transactions on Information Forensics andSecurity, vol. 12, pp. 914-925, 2016. DOI: https://doi.org/10.1109/TIFS.2016.2621888

A. Singh and S. Batra, "Ensemble based spam detection in social IoT using probabilistic data structures," Future Generation ComputerSystems, vol. 81, pp. 359-371, 2018. DOI: https://doi.org/10.1016/j.future.2017.09.072

C. Li and S. Liu, "A comparative study of the class imbalance problem in Twitter spam detection," Concurrency and Computation:Practice and Experience, vol. 30, p. e4281, 2018. DOI: https://doi.org/10.1002/cpe.4281

R. Aswani, et al., "Detection of spammers in twitter marketing: a hybrid approach using social media analytics and bio inspiredcomputing," Information Systems Frontiers, vol. 20, pp. 515-530, 2018. DOI: https://doi.org/10.1007/s10796-017-9805-8

A. T. Kabakus and R. Kara, "“TwitterSpamDetector”: A Spam Detection Framework for Twitter," International Journal of Knowledgeand Systems Science (IJKSS), vol. 10, pp. 1-14, 2019. DOI: https://doi.org/10.4018/IJKSS.2019070101

X. Wang, et al., "Drifted Twitter Spam Classification Using Multiscale Detection Test on KL Divergence," IEEE Access, vol. 7, pp.108384-108394, 2019. DOI: https://doi.org/10.1109/ACCESS.2019.2932018

B., Mukunthan. (2019). Improved Content Based Medical Image Retrieval using PCA with SURF Features. International Journal ofInnovative Technology and Exploring Engineering. 8. 10.35940/ijitee.J1020.08810S19.

M.Arunkrishna, B.Mukunthan “ Review on Classification of Anti-Spam Solutions : Approaches, Algorithms Demystified.” Studies inIndian Place Names Vol. 40 No. 60 (2020): Vol-40-Issue-60-March-2020 , vol. 40, no. 60, 6 Mar. 2020, pp. 4449–4458.

Mukunthan B, Nagaveni N. Identification of unique repeated patterns, location of mutation in DNA finger printing using artificialintelligence technique. Int J Bioinform Res Appl. 2014;10(2):157‐176. doi:10.1504/IJBRA.2014.059516 DOI: https://doi.org/10.1504/IJBRA.2014.059516

A, Pushpalatha & B, Mukunthan. (2010). Automation of DNA Finger Printing for Precise Pattern Identification using Neural-fuzzyMapping approach. International Journal of Computer Applications. 12. 10.5120/1761-2411. DOI: https://doi.org/10.5120/1761-2411

V. Vishwarupe, et al., "Intelligent Twitter spam detection: a hybrid approach," in Smart Trends in Systems, Security and Sustainability,ed: Springer, 2018, pp. 189-197. DOI: https://doi.org/10.1007/978-981-10-6916-1_17

C.C. Wei and N.S. Hsu,"Derived operating rules for a reservoir operation system: Comparison of decision trees, neural decision treesand fuzzy decision trees,"Water resources research,vol.44, 2008. DOI: https://doi.org/10.1029/2006WR005792

Mukunthan, B. & Nagaveni, N.Nagaveni. (2011). "Automating Identification of Unique Patterns, Mutation in Human DNA usingArtificial Intelligence Technique". International Journal of Computer Applications. 25. 26-34. 10.5120/3003-4038. DOI: https://doi.org/10.5120/3003-4038

H. Tajalizadeh and R. Boostani, "A novel stream clustering framework for spam detection in twitter," IEEE Transactions onComputational Social Systems, vol. 6, pp. 525-534, 2019. DOI: https://doi.org/10.1109/TCSS.2019.2910818

B., Mukunthan. (2019). Improved Content Based Medical Image Retrieval using PCA with SURF Features. International Journal ofInnovative Technology and Exploring Engineering. 8. 10.35940/ijitee.J1020.08810S19. DOI: https://doi.org/10.35940/ijitee.J1020.08810S19

Developing the Framework Using Deep Neural Network for Detection of Spam and Fake Spam Messages in Twitter

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

License

How to Cite

IssueDate

RightSideBlock

Latest publications