Malware Detection Using Machine Learning

Authors

  • Shubham Gade  Nutan Maharashtra Institute of Engineering and Technology, Pune, Maharashtra, India
  • Kaustubh Gade  Nutan Maharashtra Institute of Engineering and Technology, Pune, Maharashtra, India
  • Pratik Bhujange  Nutan Maharashtra Institute of Engineering and Technology, Pune, Maharashtra, India
  • Sonu Khapekar  Nutan Maharashtra Institute of Engineering and Technology, Pune, Maharashtra, India
  • Vilas Deotare  Nutan Maharashtra Institute of Engineering and Technology, Pune, Maharashtra, India
  • Chandrakant D. Kokane  Nutan Maharashtra Institute of Engineering and Technology, Pune, Maharashtra, India

Keywords:

Malware Detection, Cybersecurity, Machine Learning, Logistic Regression, Decision Tree, Random Forest, Feature Selection, Cyber Threats

Abstract

Malware detection is a critical cybersecurity task, and this research explores the application of machine learning techniques to enhance detection accuracy. Leveraging Logistic Regression, Decision Tree, and Random Forest Classifier algorithms, our approach effectively classifies files as benign or malicious based on extracted features. Feature selection is performed to identify the most informative attributes. The models are evaluated on performance metrics, including accuracy and ROC curves, demonstrating their effectiveness. By utilizing ensemble methods and interpretability of Decision Trees, we aim to provide robust, explainable, and high-accuracy malware detection solutions. In a comparative analysis, we assess the strengths and weaknesses of each algorithm, enabling practitioners to make informed choices. Furthermore, we address the challenge of handling imbalanced datasets, which is common in real-world scenarios, ensuring that our approach maintains a high detection rate for both benign and malicious samples.

References

  1. Akhtar, M.S.; Feng, T. IOTA based anomaly detection machine learning in mobile sensing. EAI Endorsed Trans. Create. Tech. 2022, 9, 172814.
  2. W. Han, J. Xue, Y. Wang, L. Huang, Z. Kong, L. Mao Maldae: Detecting and explaining malware based on correlation and fusion of static and dynamic characteristics Comput Secur, 83 (2019), pp. 208-233.
  3. J. Singh, J. Singh, A survey on machine learning-based malware detection in executable files, J Syst Architect, 112 (2021), Article 101861.
  4. J. Acharya, A. Chuadhary, A. Chhabria, S. Jangale, “Detecting malware, malicious urls and virus using machine learning and signature matching”, 2021 2nd International Conference for Emerging Technology (INCET) (2021), pp. 1-5.
  5. D. Gibert, C. Mateu, J. Planes, “The rise of machine learning for detection and classification of malware: research developments, trends and challenges”, J Network Comput Appl, 153 (2020), Article 102526.
  6. A. Kumar, K. Abhishek, K. Shah, D. Patel, Y. Jain, H. Chheda, P. Nerurkar, “Malware detection using machine learning”, B. Villazón-Terrazas, F. Ortiz-Rodríguez, S.M. Tiwari, S.K. Shandilya (Eds.), Knowledge Graphs and Semantic Web, Springer International Publishing, Cham (2020), pp. 61-71.
  7. Choudhary S, Sharma A. Malware detection amp; classification using machine learning. In 2020 International Conference on Emerging Trends in Communication, Control and Computing (ICONC3); 2020. pp. 1–4. doi:10.1109/ICONC345789.2020.9117547.
  8. Kokane, Chandrakant D., and Sachin D. Babar. "Supervised word sense disambiguation with recurrent neural network model." Int. J. Eng. Adv. Technol.(IJEAT) 9.2 (2019).
  9. Kokane, Chandrakant D., Sachin D. Babar, and Parikshit N. Mahalle. "Word Sense Disambiguation for Large Documents Using Neural Network Model." 2021 12th International Conference on Computing Communication and Networking Technologies (ICCCNT). IEEE, 2021.
  10. Kokane, Chandrakant D., Sachin D. Babar, and Parikshit N. Mahalle. "An adaptive algorithm for lexical ambiguity in word sense disambiguation." Proceeding of First Doctoral Symposium on Natural Computing Research: DSNCR 2020. Springer Singapore, 2021.
  11. Kokane, Chandrakant, et al. "Word Sense Disambiguation: A Supervised Semantic Similarity based Complex Network Approach." International Journal of Intelligent Systems and Applications in Engineering 10.1s (2022): 90-94.
  12. Kokane, Chandrakant D., et al. "Machine Learning Approach for Intelligent Transport System in IOV-Based Vehicular Network Traffic for Smart Cities." International Journal of Intelligent Systems and Applications in Engineering 11.11s (2023): 06-16.
  13. Kokane, Chandrakant D., et al. "Word Sense Disambiguation: Adaptive Word Embedding with Adaptive-Lexical Resource." International Conference on Data Analytics and Insights. Singapore: Springer Nature Singapore, 2023

Downloads

Published

2023-10-30

Issue

Section

Research Articles

How to Cite

[1]
Shubham Gade, Kaustubh Gade, Pratik Bhujange, Sonu Khapekar, Vilas Deotare, Chandrakant D. Kokane, " Malware Detection Using Machine Learning" International Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 9, Issue 10, pp.258-263, September-October-2023.