Enhancing Software Reliability through Intelligent Fault Prediction Using Machine Learning

Authors

  • Durga Research scholar, Department of Computer Science and Engineering, Kalinga University, Naya Raipur, Chhattisgarh, India Author
  • Dr. Anupa Sinha Assistant Professor, Department of Computer Science and Engineering, Kalinga University, Naya Raipur, Chhattisgarh, India Author

DOI:

https://doi.org/10.32628/CSEIT25113376

Keywords:

Software reliability, fault prediction, machine learning, defect detection, predictive modeling

Abstract

Software fault prediction plays a crucial role in enhancing software quality and reliability by enabling early detection of defect-prone modules. This study proposes a machine learning-based predictive framework to identify software faults using historical software metrics and defect data. The objective is to evaluate and compare the performance of various machine learning algorithms—Random Forest, XGBoost, Support Vector Machine (SVM), and Neural Networks—for their effectiveness in fault detection. Public benchmark datasets comprising object-oriented metrics such as CK, Halstead, and McCabe were used after undergoing pre-processing steps including normalization; class balancing using SMOTE, and feature selection through PCA and RFE. Models were trained and tested using stratified k-fold cross-validation, and evaluated using metrics such as Accuracy, Precision, Recall, F1-score, and ROC-AUC. Among the models, XGBoost outperformed others, achieving the highest F1-score and ROC-AUC, followed by Random Forest. Feature importance analysis and SHAP value interpretation provided insights into key predictors contributing to software faults, enhancing the model's transparency and practical usability. The study demonstrates that ensemble learning methods significantly improve the fault prediction process, making them suitable for deployment in real-world software quality assurance workflows. These findings contribute to the ongoing advancement of predictive software engineering and open new avenues for integrating intelligent fault detection systems in continuous integration and DevOps environments.

Downloads

Download data is not yet available.

References

Ahmed, S., Younis, M. S., & Arshad, F. (2022). A comprehensive empirical analysis of software defect prediction using ensemble learning. Journal of Systems and Software, 192, 111372.

Fenton, N. E., & Neil, M. (2022). Risk assessment and decision analysis with Bayesian networks. CRC Press.

Kaur, R., & Goel, S. (2023). An interpretable hybrid model for software fault prediction using metrics and code smells. Software Quality Journal, 31(1), 123–145.

Lee, H., Kim, J., & Kang, S. (2021). Challenges and recommendations for ML-based fault prediction in industrial software engineering. Empirical Software Engineering, 26(5), 97.

Malhotra, R., & Jain, A. (2022). Software Fault Prediction Models: A Systematic Review and Current Trends. ACM Computing Surveys, 55(6), 1–38.

Sahu, R. K., Sahu, A., & Sharma, S. (2023). A hybrid machine learning approach for improved software defect prediction in imbalanced datasets. Applied Intelligence, 53(3), 2345–2360.

Sharma, A., & Jain, R. (2023). Comparative analysis of rule-based and ML-based fault detection techniques in large-scale software systems. Journal of Software: Evolution and Process, 35(2), e2485.

Sun, Y., Zhang, H., & Guo, Y. (2022). Deep learning-based fault prediction in object-oriented software: A study on complexity metrics. IEEE Transactions on Software Engineering, 48(1), 33–46.

Wang, L., Zhao, H., & Xu, Z. (2023). Boosting-based ensemble strategies for software defect prediction: An empirical evaluation. Information and Software Technology, 150, 106956.

Yadav, S., & Kumar, M. (2023). A hybrid SVM and deep learning model for defect prediction using software metrics. Expert Systems with Applications, 213, 118928.

Zhang, M., Wang, X., & Yu, L. (2021). Software fault prediction using code metrics and LSTM networks. Knowledge-Based Systems, 226, 107142.

Jureczko, M., & Madeyski, L. (2020). Quality of data sets used for empirical validation of machine learning models for software defect prediction. Information and Software Technology, 122, 106275. https://doi.org/10.1016/j.infsof.2020.106275

Kamei, Y., Shihab, E., Adams, B., Hassan, A. E., Mockus, A., Sinha, A., & Ubayashi, N. (2019). A large-scale empirical study of just-in-time quality assurance. IEEE Transactions on Software Engineering, 39(6), 757–773. https://doi.org/10.1109/TSE.2012.70

Singh, R., Sikka, G., & Aggarwal, N. (2021). A review of software fault prediction studies with a focus on ensemble learning techniques. Journal of Systems and Software, 179, 111013. https://doi.org/10.1016/j.jss.2021.111013

Malhotra, R. (2015). A systematic review of machine learning techniques for software fault prediction. Applied Soft Computing, 27, 504–518. https://doi.org/10.1016/j.asoc.2014.11.023

Ambros, M., Lanza, M., & Robbes, R. (2019). An evaluation of change metrics for predicting software defects. IEEE Transactions on Software Engineering, 45(3), 273–292. https://doi.org/10.1109/TSE.2018.2794977

Downloads

Published

11-06-2025

Issue

Section

Research Articles