A New Method for Missing Value Imputation in Incomplete Information Systems Using Hybrid Rough Set Theory

Authors

  • G. V. Suresh  Research Scholar, Jawaharlal Nehru Technological University, Hyderabad, Telangana, India
  • Dr. E. Sreenivasa Reddy  Professor, University College of Engineering, Acharya Nagarjuna University, Andhra Pradesh, India

Keywords:

Decision Making, Incomplete Information Systems, Rough Set Theory (RST), Missing Value Imputation.

Abstract

Decision making has become a main reason for data analysis in the current scenario. Before analysis, the data must be freed from noise by applying data pre-processing techniques to the raw data. Missing value imputation is one of the data cleaning methods in data preprocessing. This article presents a new data imputation technique with the concepts of approximate set theory. A Hybrid Rough Set Theory for Missing Value Imputation (HRST-MVI) imputation algorithm is developed. The performance of the proposed algorithm is carried out by comparing the classification accuracy obtained, after the imputation of the missing value. The proposed method and comparative methods were compared using different classifiers in terms of accuracy, precision, recall, and F1 score. The performance of the classifiers shows that the HRST-MVI can impute missing values from multiple patterns more efficiently than other comparative methods.

References

  1. El-Hasnony IM, El-Bakry HM, Saleh AA (2016) Classification of breast cancer using soft computing techniques. Int J Electron Inf Eng 4(1):45–54.
  2. Wang H, Wang S (2010) Mining incomplete survey data through classification. Knowledge. Inf Syst 24(2):221–233.
  3. J.W. Grzymala-Busse, L.K. Goodwin, W.J. Grzymala-Busse, X. Zheng, Handling missing attribute values in preterm birth data sets, Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing, (Springer, 2005), pp.342-351.
  4. T. Schneider, Analysis of incomplete climate data: Estimation of mean values and covariance matrices and imputation of missing values, Journal of Climate, 14 (2001) 853-871.
  5. R. Jensen, C. Cornelis, Q. Shen, Hybrid fuzzy-rough rule induction and feature selection, (IEEE2009), pp.1151-1156.
  6. Pawlak, Z., 1991. Rough Set-Theoretical Aspects of Reasoning about Data. Kluwer Academic Publishers, Dordrecht.
  7. Komorowski, J., Pawlak, Z., Polkowski, L., Skowron, A., 1999. A Rough Set Perspective on Data and Knowledge. The Handbook of Data Mining and Knowledge Discovery, Oxford University Press.
  8. Kim, J., Curry, J., 1997. The treatment of missing data in multivariate analysis. Socio-logical Methods and Research 6, 215–241.
  9. Kim, J., Curry, J., 1997. The treatment of missing data in multivariate analysis. Socio-logical Methods and Research 6, 215–241.
  10. Ahmed;T.,et.al.,“Data Missing Solution Using Rough Set Theory and Swarm Intelligence”, JIJACSIT, Vol.2, No. 3, Page: 1-16, ISSN :2296-1739,2013.
  11. Yu;J.,et.al.,”Andrzej Skowron Rough Set and Knowledge Technology”, 5th International Conference, pp.135 ,2010 .
  12. Qian,Y., et al.: Multi-granulation decision-theoretic rough sets. Int. J.Approx.Reason.55(1), 225–237(2014)
  13. Zhang,R., et al.: Rough set  attribute reduction algorithm based on tabu discrete particle swarm optimization. Chin.Comput.Syst.38(008),1840–1844(2017).
  14. He, L., et al.: Hybrid multi-granulation rough sets of variable precision based on tolerance .J. Intell.FuzzySyst.31(2), 717–725(2015)
  15. Pawlak,Z.: Roughset. Int. J. Comput.Inf. Sci. 11(5), 341–356(1982)
  16. Pawlak,Z., Skowron,A.: Rudiments of rough sets. Inf. Sci. 177(1), 3–27(2007)
  17. Jaddi,N., Abdullah,S.: Hybrid of genetic algorithm and great deluge algorithm for rough set attribute reduction. Turk. J. Electr. Eng. Comput.Sci. 21(6), 1737–1750(2013)
  18. Sang,Y., Qian,Y.: Granular structure reduction approach to multi-granulation  decision-theoretic rough sets.  Comput.Sci.44(005),199–205(2017).
  19. Chebrolu,S., Sanjeevi,S.: Attribute reduction in decision-theoretic rough set model using particle swarm optimization with the threshold parameters determined using LMS training rule. Procedia Comput. Sci. 57,527–536(2015)
  20. Chen, Y, Miao, D, Wang, R and Wu, K. ”A Rough Set Approach to Feature Selection Based On Power Set Tree”. Knowledge-Based System, Vol. 24, PP. 275–281, 2011.
  21. Guo, Q.L and Zhang, M. ”Implement web learning environment based on data mining”, Knowledge-Based Systems, Vol. 22, No. 6, PP. 439-442, 2009.
  22. Wroblewski, J. ”Finding minimal reducts using genetic algorithms”. In Proceedings of the second annual join conference on information science, PP. 186-189, 1995.

Downloads

Published

2020-08-31

Issue

Section

Research Articles

How to Cite

[1]
G. V. Suresh, Dr. E. Sreenivasa Reddy, " A New Method for Missing Value Imputation in Incomplete Information Systems Using Hybrid Rough Set Theory" International Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 6, Issue 4, pp.580-592, July-August-2020.