Upgrading the Performance of KNN and Naïve Bayes in Diabetes Detection with Genetic Algorithm for Feature Selection

Authors

  • Ratna Nitin Patil  Department of Computer Engineering, Research Scholar, Babasaheb Ambedkar Marathwada University, Aurangabad, India
  • Dr. Sharvari Chandrashekhar Tamane  Department of Computer Engineering JNEC, Aurangabad, India.

Keywords:

Data Mining; Diabetes Mellitus; feature selection; Classification; Genetic Algorithm search Paper, Technical Writing, Science and Technology

Abstract

Data mining is a science that is used for finding models and association rules in huge data where other statistical analysis cannot do that. The medical science needs data mining for analyzing massive data and producing predictive models. The purpose of this research is to present a framework for creating, evaluation and exploitation of data mining models. In this study we have used the combination of artificial intelligent technique such as feature selection with k Nearest Neighbor (kNN) and Naïve Bayes approach to develop a predictive model which classifies the patient as healthy and diabetic [1][2]. The main purpose of feature subset selection is to reduce the number of features used in classification while maintaining the acceptable classification accuracy. Our proposed Genetic algorithm (GA) for feature selection approach improves the classification accuracy and uses fewer input features [3]. Some attributes in the dataset may not be useful for diagnosis and thus can be eliminated before learning. The goal of feature selecting is to find a least set of attributes so that the resulting probability distribution of the data classes is close to the original distribution obtained by all attributes [4]. Irrelevant, redundant, or noisy data can be removed by using Genetic Algorithm which in turn improves the mining performance such as predictive accuracy and result comprehensibility

References

  1. Nirmala Devi M.,Appavu S. , Swathi U.V., ―An amalgam KNN to predict diabetes mellitus:, IEEE, 2013.
  2. Rashedur M. Rahman, Farhana Afroz “ Comparison of Various Classification Techniques Using Different Data Mining Tools for Diabetes Diagnosis”, Journal of Software Engineering and Applications, 2013, 6, 85-97.
  3. C. Gunavathi, K. Premalatha “Performance Analysis of Genetic Algorithm with kNN and SVM for Feature Selection in Tumor Classification”, World Academy of Science, Engineering and Technology International Journal of Computer and Information Engineering Vol:8, No:8, 2014.
  4. Devi kalyan Karumanchi, James Dillon and Elizabeth Gaillard “ Early diagnosis of Diabetes mellitus through the eye “2nd International Conference on Endocrinology October 20-22, 2014.
  5. Selvakuberan, K., Kayathiri, D., Harini, B. and Devi, M.I “An Efficient Feature Selection Method for Classification in Health Care Systems Using Machine Learning Techniques”, IEEE. pp.8610–8615, 2011.
  6. Amir Amiri and Vahid Rafe “Hybrid Algorithm for Detecting Diabetes “, International Research Journal of Applied and Basic Sciences © 2014 Available online at www.irjabs.com ISSN 2251-838X / Vol, 8 (12): 2347-2353 Science Explorer Publications.
  7. M Nahla H. Barakat, Andrew P. Bradley, Senior Member, IEEE, and Mohamed Nabil H.Barakat “Intelligible Support Vector Machines For Diagnosis Of Diabetes Mellitus” IEEE Transactions On Information Technology In Biomedicine, Vol. 14, No. 4, July 2010 Digital Object Identifier10.1109/TITB.2009.2039485 July 2010.
  8. R.R.Janghel, Anupam Shukla, Ritu Tiwari, Rahul Kala “Breast Cancer Diagnostic System using Symbiotic Adaptive Neuro-evolution (SANE)” Proceedings of the 2010 IEEE International Conference of Soft Computing and Pattern Recognition, Cercy Pontoise/Paris, France, pp 326-329.
  9. Parashar, Rawat “Diagnosis of PIMA Indian Diabetes by LDA-SVM Approach”, International Journal of Engineering Research & Technology, Vol 3, issue10, Oct2014, ISSN2278-0181.
  10. Norul Hidayah Ibrahim et.al “ A Hybrid Model of Hierarchical Clustering and Decision Tree for Rule-based Classification of Diabetic Patients” nternational Journal of Engineering and Technology, Vol 5 No 5 Oct-Nov 2013.
  11. Jayalakshmi, T. and Santhakumaran, “A novel classification method for diagnosis of diabetes mellitus using artificial neural networks”, International Conference on Data Storage and Data Engineering, DSDE 2010, Bangalore, India, pp.159–163.
  12. Patil, B.M., Joshi, R.C. and Toshniwal “Association rule for classification of type-2 diabetic patients”, Second International Conference on Machine Learning and Computing 2010, IEEE.
  13. M. Kothainayaki, P. Thangaraj “Clustering and Classifying Diabetic Data Sets Using K-Means Algorithm” Article can be accessed online at http://www.publishingindia.
  14. Md. Mozaharul Mottalib, Md. Mokhlesur Rahman, Md. Tarek Habib and Farruk Ahmed “Detection of the Onset of Diabetes Mellitus by Bayesian Classifier Based Medical Expert System” Transaction on Machine Learning and Artificial Intelligence DOI: 10.14738/tmlai.44.1962 Publication Date: 19th July, 2016.
  15. K. Saravananathan and T. Velmurugan, “Analyzing Diabetic Data using Classification Algorithms in Data Mining,” In proceeding of Indian Journal of Science and Technology, vol 9, no. 43, Nov 2016.
  16. Mostafa EL HABIB DAHO, Nesma SETTOUTI, Mohammed El Amine LAZOUNI, M. Amine CHIKH “Recognition of diabetes disease using a new hybrid learning algorithm for NEFCLASS” Conference: Conference: IEEE Proceeding of the 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA), 2013.
  17. Seera, Lim “A hybrid intelligent system for medical data classification” Volume 41, Issue 5, April 2014, Pages 2239-2249.
  18. Pasi Luukka “Feature selection using fuzzy entropy measures with similarity classifier” Expert Systems with Applications Elsevier Volume 38, Issue 4, April 2011, Pages 4600-4607.
  19. Savvas Karatsiolis, Christos N. Schizas “Region-based support vector machine algorithm for medical diagnosis on Pima Indian diabetes dataset” pp:139-144DOIBookmark:.http://doi.ieeecomputersociety.org/10.1109/ BIBE.2012.6399663
  20. HasanTemurtas, NejatYumusak,FeyzullahTemurtas “A comparative study on diabetes disease diagnosis using neural networks” Expert Systems with Applications Elsevier Volume 36, Issue 4, May 2009, Pages 8610-8615.
  21. Margret Anouncia S., Clara Madonna L. J., Jeevitha P., Nandhini R. T. “Design of a Diabetic Diagnosis System Using Rough Sets” CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 13, No 3 Print ISSN: 1311-9702; Online ISSN: 1314-4081 DOI: 10.2478/cait-2013-0030.
  22. EsinDogantekin,AkifDogantekin, DeryaAvci,LeventAvcid “An intelligent diagnosis system for diabetes on Linear Discriminant Analysis and Adaptive Network Based Fuzzy Inference System: LDA-ANFIS” Digital Signal Processing Elsevier Volume 20, Issue 4, July 2010, Pages 1248-1255.
  23. Muhammad Waqar Aslam et al. “Feature generation using genetic programming with comparative partner selection for diabetes classification” Expert Systems with Applications 40(13):5402-5412 · October 2013.
  24. Mostafa Fathi, Mohammad Saniee Abadeh “Using fuzzy ant colony optimization for diagnosis of diabetes disease” DOI: 10.1109/IRANIANCEE.2010.5507019 · Source: IEEE Xplore Conference: Conference: Electrical Engineering (ICEE), 2010 18th Iranian Conference.

Downloads

Published

2018-02-28

Issue

Section

Research Articles

How to Cite

[1]
Ratna Nitin Patil, Dr. Sharvari Chandrashekhar Tamane, " Upgrading the Performance of KNN and Naïve Bayes in Diabetes Detection with Genetic Algorithm for Feature Selection, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 1, pp.1371-1381, January-February-2018.