Application of Data Mining Techniques for Prediction of Diabetes - A Review

Authors

  • Santosh Rani  M.Tech Scholar, Department of computer Science and Engineering , Guru Kashi University, Talwandi, Bathinda, Punjab, India
  • Dr. Sandeep Kautish  Professor, Department of computer Science and Engineering , Guru Kashi University, Talwandi, Bathinda, Punjab, India

Keywords:

Keywords- Prediction, Association, Classification, Data mining, Regression.

Abstract

During past years, the increase in use of internet and scientific information, enormous amount of data produced every minute and this caused growth in lots of repositories and databases. Health care industry also contains a wealth of database. Millions of patients’ database stored in repositories every year. The health care industry ironic in information but knowledge is meager because, with valuable information unsolicited information is also stored and sometimes the valuable information is not analyzed. This unimportant information increases the difficulties of doctors for disease prediction. To solve the problem of this medical mismanagement in health care industry, various machine learning and data mining techniques are used. “Data mining is a process to analyzing the data from large databases. As it is also clear from its name Data Mining searching for valuable information in a large database”. Data mining is also known as knowledge discovery. This paper reviews the application and techniques of data mining to determined how these application and techniques have established, during the past years. The main attention is to explore data mining techniques which are widely used to predict some chronic disease like diabetes and cancer, heart attack and for this various most sited research papers of highest journals are reviewed. The techniques of data mining namely, neural networks, association, classification, regression are analyzed in this review paper.

References

  1. https://en.wikipedia.org/wiki/Data_mi ning
  2. Jerome H. Friedman, "Data mining and Statistic: What's Conection?"
  3. M. Durairaj, V. Ranjani, "Data Mining Applications In Healthcare Sector:" , INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME 2, ISSUE 10, OCTOBER 2013 ISSN 2277-8616 29 IJSTR©2013 www.ijstr.org
  4. Sunny Sharma, "A Study on Data Mining Horizons", International Journal of Recent Trends in Engineering & Research (IJRTER) Volume 02, Issue 04; April - 2016 ISSN: 2455-1457]
  5. Charu C. Aggarwal and Philip S. Yu, "Data Mining Techniques for Associations, Clustering and Classification", N. Zhong and L. Zhou (Eds.): PAKDD'99, LNAI 1574, pp. 13-23, 1999. c_Springer-Verlag Berlin Heidelberg .
  6. Andreas Buja , Young-Seop Lee, "Data Mining Criteria for Tree-Based Regression and Classifiaction" ACM , 2001 1-5811-391-x/01/08
  7. Md. Maniruzzaman et.el , "Comparative Approaches for Classification of Diabetes Mellitus Data: Machine Learning Paradigm"S0169-2607 (17)30282-1
  8. Fadi Thabtah (2008) "Mining the data from a hyperheuristic approach using associative classification", Science Direct ,Expert Systems with Applications 34 (2008) 1093-1101
  9. Sajida Perveena, "Performance Analysis of Data Mining Classification Techniques to Predict Diabetes", Elsevier, Procedia Computer Science 82 ( 2016 ) 115 - 121
  10. Roger J. Marshall "The use of classification and regression trees in clinical epidemiology", 2001 Elsevier Science Inc., Journal of Clinical Epidemiology 54 (2001) 603-609,
  11. Bathany A "Evaluating implicit spider fear associations using the Go/No-go Association Task" Elsevier, une 2007, Pages 156-167, volume 38, issue 2
  12. lRedaBoukezzoula et.al "From fuzzy regression to gradual regression: Interval-based analysis and extensions" Elsevier , May 2018, Pages 18-40, volume 441.
  13. G. Trepepi et.al (2008)"Linear and logistic regression analysis", Elsevier, 1 April 2008, Pages 806-810, volume 73.issue 7.
  14. Chang SikKim and SungroLee ,"Spurious regressions driven by excessive volatility" Elsevier December 2011, Pages 292-297, volume 113, issue 3. Support vector regression
  15. Cheng-Fa Tsai et.al "A New Data Clustering Approach for Data Mining in Large Databases" , Proceedings of the International Symposium on Parallel Architectures, Algorithms and Networks (ISPAN.02) 1087-4089/02 $17.00 © 2002 IEEE
  16. R. Mythily et.al "Clustering Models for Data Stream Mining", Procedia Computer Science 46 ( 2015 ) 619 - 626, Elsevier B.V
  17. Luca De Angelis and Jose G. Dias b "Mining categorical sequences from data using a hybrid clustering Method" European Journal of Operational Research 234 (2014) 720-730, Science Direct Method" , 0377-2217/$ - see front matter _ 2013 Elsevier B.
  18. Steve Russell and Weldon Lodwick "Fuzzy Clustering in Data Mining for Telco Database Marketing Campaigns" , 0-7803-521 1 -4/99/$10.00 0 1999 IEEE
  19. D.D.Liu et.al "The hourly average solar wind velocity prediction based on support vector regression method" volume: 413, Issue: 4, June 2011)
  20. Basheer M. Al-Maqaleh, "Forrecasting using artificial Neural Network and Statistic Models" I.J. Education and Management Engineering, 2016, 3, 20-32.
  21. Adrian Horzyk and Janusz A.Starzyk (2017) Fast neural nerwork
  22. Adrian Horzyk and Janusz A, "Fast Neural; Network Adaption with Associative Pulsing Neurons", 978-1-5386-2726-6/17/$31.00 ©2017 IEEE. Ahmed R. Abas (2013) Adaptive competitive learning neural networks
  23. Pragati Shukla, Simran Lal, Gauri Kumbhar, Susmita Kulkarni, Mrs. V. V. Waykule, "DISEASE STATUS PREDICTION AND IDENTIFICATION",IJAREST,Volume 5, Issue 1,pp:10-17,2018.
  24. Raid M. Khalil,Adel Al-Jumaily,"Machine Learning Based Prediction of Depression among Type 2 Diabetic Patients",ISKE,vol.18,pp:456-466,2017.
  25. Han Wu, Shengqi Yang, Zhangqin Huang, Jian He, Xiaoyi Wang,"Type 2 diabetes mellitus prediction model based on data mining",Elsevier,vol.4,pp:190-210,2017.
  26. Ahmed R. Abas " Adaptive competitive learning neural networks", Egyptian Informatics Journal (2013) 14, 183-194 l1110-8665 _ 2013 Production and hosting by Elsevier B.V. on behalf of Faculty of Computers and Information,
  27. Enas M.F. El Houby , "A survey on applying machine learning techniques for management of diseases". Elsevier, (2017)
  28. Takuo Emoto,Tomoya Yamashita,Toshio Kobayashi,Naoto Sasaki,Yushi Hirota,Tomohiro Hayashi, Anna So, Kazuyuki Kasahara,Keiko Yodoi,Takuya Matsumoto,Taiji Mizoguchi,Wataru Ogawa,Ken‑ichi Hirata,"Characterization of gut microbiota profiles in coronary artery disease patients using data mining analysis of terminal restriction fragment length polymorphism: gut microbiota could be a diagnostic marker of coronary artery disease",Springer,vol.3,pp:89-100,2016.
  29. B.M. Patil, Hybrid prediction model for Type-2 diabetic patients. Expert Systems with Applications 37 (2010) 8102-8108.
  30. Aliza Ahmad and Aida MustaphaH, Comparison between Neural Networks against Decision Tree in Improving Prediction Accuracy for Diabetes Mellitus. ICDIPC 2011, Part I, CCIS 188, pp. 537-545, 2011.
  31. Alexis Marcano-Cedeño, Joaquín Torres, and Diego Andina, A Prediction Model to Diabetes Using Artificial Metaplasticity. IWINAC 2011, Part II, LNCS 6687, pp. 418-425, 2011.
  32. Humar, K. and Novruz, A., Design of a hybrid system for the diabetes and heart diseases. Expert Systems with Applications, 35, 82-89, 2008.
  33. Rebecca Schnall and Marlene Rojas, A user-centered model for designing consumer mobile health (mHealth) applications (apps). Journal of Biomedical Informatics 60 (2016) 243-2 33Yogita Gupta, Rana Khudhair Abbas Ahmed2 and Dr. Sandeep Kumar Kautish, "APPLICATION OF DATA MINING AND KNOWLEDGE MANAGEMENT IN SPECIAL REFERENCE TO MEDICAL INFORMATICS: A REVIEW" , vol 2, issue 2 , Int J Med Lab Res 2017, 2(2): 60-76

Downloads

Published

2018-04-30

Issue

Section

Research Articles

How to Cite

[1]
Santosh Rani, Dr. Sandeep Kautish, " Application of Data Mining Techniques for Prediction of Diabetes - A Review , IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 3, pp.1996-2004, March-April-2018.