A Study Using Machine Learning Techniques to Predict Software Quality

Authors

  • B. Sivaji  MCA, Madanapalle Institute of Technology & Science, Madanapalle, India
  • D. Santhosh  MCA, Madanapalle Institute of Technology & Science, Madanapalle, India

Keywords:

Estimation, Machine Learning, Software Quality, Extreme Gradient Decent, Boosting.

Abstract

Software quality estimation is an activity needed at various stages of software development. It may be used for planning the project`s quality assurance practices and for benchmarking. In earlier previous studies, two methods (Multiple Criteria Linear Programming and Multiple Criteria Quadratic Programming) for estimating the quality of software had been used. Also, C5.0, SVM and Neutral network were experimented with for quality estimation. These studies have relatively low accuracies. In this study, we aimed to improve estimation accuracy by using relevant features of a large dataset. We used a feature selection method and correlation matrix for reaching higher accuracies. In addition, we have experimented with recent methods shown to be successful for other prediction tasks. Machine learning algorithms such as XGBoost, Random Forest, Decision Tree, Logistic Regression and Naïve Bayes are applied to the data to predict the software quality and reveal the relation between the quality and development attributes. The experimental results show that the quality level of software can be well estimated by machine learning algorithms.

References

  1. N.Kalaivani, Dr.R.Beena, International Journal of Pure and Applied Mathematics Volume 118 No. 20 2018, 3863-3873 ISSN: 1314-3395.
  2. He, Peng, et al. "An empirical study on software defect prediction with a simplified metric set." Information and Software Technology 59 (2015): 170-190.
  3. Yu, Xiao, et al. "Using Class Imbalance Learning for Cross-Company Defect Prediction." 29th International Conference on Software Engineering and Knowledge Engineering (SEKE 2017). KSI Research Inc. and Knowledge Systems Institute, 2017.
  4. D. Bowes, T. Hall, and J. Petrić, "Software defect prediction: do different classifiers find the same defects?." Software Quality Journal, 26(2), 2018, pp. 525-552.
  5. X. Wang, Y. Zhang, L. Zhang and Y. Shi, "A Knowledge Discovery Case Study of Software Quality Prediction: ISBSG Database," 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, Toronto, ON, 2010, pp. 219-222.
  6. X. Wang, Y. Zhang, L. Zhang and Y. Shi, "A Knowledge Discovery Case Study of Software Quality Prediction Based on Classification Models: ISBSG Database," The 11th International Symposium on Knowledge Systems Sciences (KSS 2010), 2010.
  7. S. Sohrabi, O. Udrea, and A. V. Riabov, “Hypothesis Exploration for Malware Detection Using Planning,” Twenty-Seventh AAAI Conf. Artif. Intell., pp. 883– 889, 2013.
  8. Zimmermann, Thomas, et al. "Cross-project defect prediction: a large scale experiment on data vs. domain vs. process." Proceedings of the the 7th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering. ACM, 2009.
  9. Amasaki, S., Takagi, Y., Mizuno, O., Kikuno, T.: Constructing a bayesian belief network topredict final quality in embedded system development. IEICE Trans. Inf. Syst.8(6), 1134–1141(2005).
  10. Idri, A., Abra, A.: A fuzzy logic based measures for software project similarity: validation andpossible improvements. In: Proceedings of 7th International Symposium on Software Metrics,pp. 85–96. IEEE, England, UK (2001).
  11. Pattnaik, S., Pattanayak, B.K.: A survey on machine learning techniques used for software quality prediction. Int. J. Reasoning-Based Intell. Syst. 12(1/2), 3–14 (2016).
  12. Kapur, P.K., Khatri, S.K., Goswami, D.N.: A generalized dynamic integration software reliability growth model based on neural network approach. In: Proceedings of International Conference on Reliability, Safety and Quality Engineering, pp. 831–838 (2008).
  13. Wagner, S.: A bayesian network approach to assess and predict software quality using activity-based quality model. Inf. Softw. Technol. 52(11), 1230–1241 (2010).
  14. Ahmed, M.A., AL-Jamini, H.A.: Machine learning approaches for predicting software maintainability: a Fuzzy based transparent model. IET Softw. 7(6), 317–326 (2013).
  15. Pattnaik, S., Pattanayak, B.K., Patnail, S.: Prediction of software quality using neuro-fuzzy model. Int. J. Intell. Enterp. (IJIE) 5(3), 292–307 (2018).

Downloads

Published

2022-08-30

Issue

Section

Research Articles

How to Cite

[1]
B. Sivaji, D. Santhosh, " A Study Using Machine Learning Techniques to Predict Software Quality, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 8, Issue 4, pp.316-323, July-August-2022.