A Comprehensive Review on Email Spam Classification with Machine Learning Methods

Authors

  • Prachi Bhatnagar  Research Scholar, Dept. of Computer Engineering, Sigma Institute of Engineering, Gujarat, India
  • Sheshang Degadwala  Associate Professor & Head of Department, Dept. of Computer Engineering, Sigma University, Gujarat, India

DOI:

https://doi.org/10.32628/CSEIT2361048

Keywords:

Email, Spam, Classification, Machine Learning, Algorithms, Review, Detection.

Abstract

This comprehensive review delves into the realm of email spam classification, scrutinizing the efficacy of various machine learning methods employed in the ongoing battle against unwanted email communication. The paper synthesizes a wide array of research findings, methodologies, and performance metrics to provide a holistic perspective on the evolving landscape of spam detection. Emphasizing the pivotal role of machine learning in addressing the dynamic nature of spam, the review explores the strengths and limitations of popular algorithms such as Naive Bayes, Support Vector Machines, and neural networks. Additionally, it examines feature engineering, dataset characteristics, and evolving threats, offering insights into the challenges and opportunities within the field. With a focus on recent advancements and emerging trends, this review aims to guide researchers, practitioners, and developers in the ongoing pursuit of robust and adaptive email spam classification systems.

References

  1. K. Taghandiki, “Building an Effective Email Spam Classification Model with spaCy,” pp. 1–5, 2023, [Online]. Available: http://arxiv.org/abs/2303.08792
  2. R. Fatima et al., “An Optimized Approach For Detection and Classification of Spam Email’s Using Ensemble Methods,” 2023.
  3. L. Jeeva and I. S. Khan, “Enhancing Email Spam Filter ’ s Accuracy Using Machine Learning,” vol. 5, no. 4, pp. 1–12, 2023.
  4. M. A. Bouke, A. Abdullah, and M. T. Abdullah, “A Lightweight Machine Learning-Based Email Spam Detection Model Using Word Frequency Pattern,” vol. 4, no. 1, pp. 15–28, 2023, doi: 10.48185/jitc.v4i1.653.
  5. H. Takci and F. Nusrat, “Highly Accurate Spam Detection with the Help of Feature Selection and Data Transformation,” International Arab Journal of Information Technology, vol. 20, no. 1, pp. 29–37, 2023, doi: 10.34028/iajit/20/1/4.
  6. K. Iqbal and M. S. Khan, “Email classification analysis using machine learning techniques,” Applied Computing and Informatics, 2022, doi: 10.1108/ACI-01-2022-0012.
  7. H. Lee, S. Jeong, S. Cho, and E. Choi, “Visualization Technology and Deep-Learning for Multilingual Spam Message Detection,” Electronics (Switzerland), vol. 12, no. 3, 2023, doi: 10.3390/electronics12030582.
  8. T. S. Dhivya, S. G. Priya, Bt. Student, and T. Fellow, “Email Spam Detection and Data Optimization using NLP Techniques,” International Journal of Engineering Research & Technology, vol. 10, no. 08, pp. 38–49, 2021, [Online]. Available: www.ijert.org
  9. A. Masri and M. Al-Jabi, “A novel approach for Arabic business email classification based on deep learning machines,” PeerJ Computer Science, vol. 9, no. 2017, p. e1221, 2023, doi: 10.7717/peerj-cs.1221.
  10. A. Junnarkar, S. Adhikari, J. Fagania, P. Chimurkar, and D. Karia, “E-mail spam classification via machine learning and natural language processing,” Proceedings of the 3rd International Conference on Intelligent Communication Technologies and Virtual Mobile Networks, ICICV 2021, no. Icicv, pp. 693–699, 2021, doi: 10.1109/ICICV50876.2021.9388530.
  11. M. Crawford, T. M. Khoshgoftaar, J. D. Prusa, A. N. Richter, and H. Al Najada, “Survey of review spam detection using machine learning techniques,” Journal of Big Data, vol. 2, no. 1, 2015, doi: 10.1186/s40537-015-0029-9.
  12. S. Cheng, “Classification of Spam E-mail based on Naïve Bayes Classification Model,” Highlights in Science, Engineering and Technology, vol. 39, pp. 749–753, 2023, doi: 10.54097/hset.v39i.6640.
  13. N. Ahmed, R. Amin, H. Aldabbas, D. Koundal, B. Alouffi, and T. Shah, “Machine Learning Techniques for Spam Detection in Email and IoT Platforms: Analysis and Research Challenges,” Security and Communication Networks, vol. 2022, 2022, doi: 10.1155/2022/1862888.
  14. I. AbdulNabi and Q. Yaseen, “Spam email detection using deep learning techniques,” Procedia Computer Science, vol. 184, no. 2019, pp. 853–858, 2021, doi: 10.1016/j.procs.2021.03.107.
  15.  E. G. Dada, J. S. Bassi, H. Chiroma, S. M. Abdulhamid, A. O. Adetunmbi, and O. E. Ajibuwa, “Machine learning for email spam filtering: review, approaches and open research problems,” Heliyon, vol. 5, no. 6, 2019, doi: 10.1016/j.heliyon.2019.e01802.

Downloads

Published

2023-10-30

Issue

Section

Research Articles

How to Cite

[1]
Prachi Bhatnagar, Sheshang Degadwala, " A Comprehensive Review on Email Spam Classification with Machine Learning Methods" International Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 9, Issue 10, pp.283-288, September-October-2023. Available at doi : https://doi.org/10.32628/CSEIT2361048