Phishing Website Detection Based on URL

Authors

  • Salvi Siddhi Ravindra  Computer Engineering Department, Shah & Anchor Kutchhi Engineering College Mumbai, Maharashtra, India
  • Shah Juhi Sanjay  Computer Engineering Department, Shah & Anchor Kutchhi Engineering College Mumbai, Maharashtra, India
  • Shaikh Nausheenbanu Ahmed Gulzar  Computer Engineering Department, Shah & Anchor Kutchhi Engineering College Mumbai, Maharashtra, India
  • Khodke Pallavi  Computer Engineering Department, Shah & Anchor Kutchhi Engineering College Mumbai, Maharashtra, India

DOI:

https://doi.org//10.32628/CSEIT2173124

Keywords:

URLs, Phishing, Legitimate, Machine Learning.

Abstract

In today's era, due to the surge in the usage of the internet and other online platforms, security has been major attention. Many cyberattacks take place each day out of which website phishing is the most common issue. It is an act of imitating a legitimate website and thereby tricking the users and stealing their sensitive information. So, concerning this problem, this paper will introduce a possible solution to avoid such attacks by checking whether the provided URLs are phishing URLs or legitimate URLs. It is a Machine Learning based system especially Supervised learning where we have provided 2000 phishing and 2000 legitimate URL dataset. We have taken into consideration the Random Forest Algorithm due to its performance and accuracy. It considers 9 features and hence detects whether the URL is safe to access or a phishing URL.

References

  1. Khonji, Mahmoud, Youssef Iraqi, and Andrew Jones. "Phishing detection: a literature survey." Communications Surveys & Tutorials, IEEE 15.4 (2013): 2091-2121.
  2. Anti Phishing Working Group. (2015. March.) APWG Phishing Activity Trend Report 2nd Quarter 2010. [Online]. Available: http://docs.apwg.org/reports/apwg_trends_report_q2_2 014.pdf
  3. Anti Phishing Working Group. (2015. March.) APWG Phishing Activity Trend Report 2nd Quarter 2014. [Online]. Available: http://docs.apwg.org/reports/apwg_report_q2_2010.pdf
  4. Huang, Huajun, Junshan Tan, and Lingxi Liu. "Countermeasure techniques for deceptive phishing attack." New Trends in Information and Service Science, 2009. NISS'09. International Conference on. IEEE, 2009.
  5. Ma, Justin, et al. "Beyond blacklists: learning to detect malicious web sites from suspicious URLs." Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2009.
  6. Nguyen, Luong Anh Tuan, et al. "A novel approach for phishing detection using URL-based heuristic." Computing, Management and Telecommunications (ComManTel), 2014 International Conference on. IEEE, 2014.
  7. Wikipedia. (2015. March) Uniform Resource Loactor. Avaliable: http://en.wikipedia.org/wiki/Uniform_resource_locator
  8. Kausar, Firdous, et al. "Hybrid Client Side Phishing Websites Detection Approach." International Journal of Advanced Computer Science and Applications (IJACSA) 5.7 (2014).
  9. Sunil, A. Naga Venkata, and Anjali Sardana. "A pagerank based detection technique for phishing web sites." Computers & Informatics (ISCI), 2012 IEEE Symposium on. IEEE, 2012.
  10. Mohammad, Rami M., Fadi Thabtah, and Lee McCluskey. "Intelligent rule-based phishing websites classification." Information Security, IET 8.3 (2014): 153-160.
  11. Singh, C., & `Meenu., "Phishing Website Detection Based on Machine Learning: A Survey", IEEE 6th International Conference on Advanced Computing & Communication Systems, Gorakhpur, India, 2020, 978- 1-7281-5197-7.
  12. Aydin, M., Butun, I., Bicakci, K., & Baykal, N., "Using Attribute-based Feature Selection Approaches and Machine Learning Algorithms for Detecting Fraudulent Website URLs",IEEE 10th Annual Computing and Communication Workshop and Conference, Ankara, Turkey,Goteborg, Sweden, Guzelyurt, Cyprus, 30-May- 2020, 978-1-7281-3783-4.
  13. A, A. A., & K, P. “Towards the Detection of Phishing Attacks”, IEEE 4th International Conference on Trends in Electronics and Informatics, Coimbatore,India, July 27-2020, 978-1-7281-5518-0
  14. Arun Kulkarni & Leonard L. Brown, ” Phishing Websites Detection using Machine Learning ”, International Journal of Advanced Computer Science and Applications , Tyler, TX, 2019.
  15. El Aassal, A., Baki, S., Das, A., & Verma, R. M.,"An In-Depth Benchmarking and Evaluationof Phishing Detection Research for Security Needs", IEEE Access,Houston, U.S., 5-Feb-2020, 2969780.
  16. Korkmaz, M., Sahingoz, O. K., & Diri, B., "Detection of Phishing Websites by Using Machine Learning- Based URL Analysis",IEEE IIT - Kharagpur, Istanbul, Turkey, 1-Jul-2020, 49239.
  17. Mohammed Hazim Alkawaz, Stephanie Joanne Steven and Asif Iqbal Hajamydeen,"Detecting Phishing Website Using Machine Learning",IEEE International Colloquium on Signal Processing & its Applications, Selangor, Malaysia, 28-Feb-2020, 978-1-7281-5310-0.
  18. Mohammed Zakariah, “Classification of large datasets using Random Forest Algorithm in various applications: Survey”, International Journal of Engineering and Innovative Technology, Riyadh, Kingdom of Saudi Arabia, September 2014, 2277-3754.
  19. Mehmet Korkmaz, Ozgur Koray Sahingoz & Banu Diri, “Detection of Phishing Websites by Using Machine Learning-Based URL Analysis”, from IEEE Xplore, Istanbul/Turkey, July 2020, 49239.
  20. Bhagyashree E. Sananse & Tanuja K. Sarode, "Phishing URL Detection: A Machine Learning and Web Mining- based Approach", International Journal of Computer Applications, Mumbai, August 2015, 0975 – 8887.
  21. Rishikesh Mahajan & Irfan Siddavatam, "Phishing Website Detection using Machine Learning Algorithms", International Journal of Computer Applications, Mumbai, October 2018,328541785.
  22. Jin-Lee Lee, Dong-Hyun Kim & Chang-Hoon, Lee, "Heuristic-based Approach for Phishing Site Detection Using URL Features", Conf. on Advances in Computing, Electronics and Electrical Technology, USA, 2015, 978-1-63248-056-9-84.

Downloads

Published

2021-06-30

Issue

Section

Research Articles

How to Cite

[1]
Salvi Siddhi Ravindra, Shah Juhi Sanjay, Shaikh Nausheenbanu Ahmed Gulzar, Khodke Pallavi, " Phishing Website Detection Based on URL, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 7, Issue 3, pp.589-594, May-June-2021. Available at doi : https://doi.org/10.32628/CSEIT2173124