Risk Prediction for Coronary Heart Disease Using C5.0 Decision Tree Algorithm

Authors

  • M. Nivedhika  Computer Science Department, Pondicherry University, Pondicherry, India

Keywords:

Decision Tree(DT), Data Mining(DM) and C5.0 algorithm

Abstract

Heart Disease is now-a-days one of the most leading causes of death rate in the worlds. Here we aim to explain the CHD predictive model using C5.0 one of the DT algorithm. In a every single day there are huge amount of data were providing in the hospitals. For making a correct decision for the disease some of the hidden information are not mined properly or effectively. So, here we are using some DM techniques which can make a solution to this situation. This work has been developed using the DM techniques namely DT (CART, C5.0) algorithms and classification (Naive Bayes, Neural Network) algorithms. Result shows each technique has its own specific strength in realizing the objective of the defined mining goals. Using attributes (age,sex,cp, trestbps, chol, fbs, restecg, thalach, exang, oldpeak, slope, thal, ca, num) it can predict the likelihood of patients getting heart disease. In this paper we studied and validated the predictive power of DM algorithms by comparing the performance of C5.0 with two classifier algorithm Naive Bayes, Neural Network and one DT algorithms CART. At finally we are comparing it Accuracy, Sensitivity, Specificity with two different types of classification algorithm and one DT algorithm. Contrary to the former study, the C5.0 algorithm performed best than the other two classifiers algorithm and one DT algorithm in predicting CHD, and C5.0 have the highest predictive power. This paper provides an insight about C5.0 DT algorithm used to predict the heart diseases.

References

  1. Ding K, Bailey KR, Kullo IJ. Genotype-informed estimation of risk of coronary heart disease based on genome-wide association data linked to the electronic medical record. BMC cardiovascular disorders. 2011 Dec;11(1):66.
  2. Janssen DJ, Wouters EF, Schols JM, Spruit MA. Self-perceived symptoms and care needs of patients with severe to very severe chronic obstructive pulmonary disease, congestive heart failure or chronic renal failure and its consequences for their closest relatives: the research protocol. BMC Palliative Care. 2008 Dec;7(1):5.
  3. Lu SE, Beckles GL, Crosson JC, Bilik D, Karter AJ, Gerzoff RB, Lin Y, Ross SV, McEwen LN, Waitzfelder BE, Marrero D. Evaluation of risk equations for prediction of short-term coronary heart disease events in patients with long-standing type 2 diabetes: the Translating Research into Action for Diabetes (TRIAD) study. BMC endocrine disorders. 2012 Dec;12(1):12.
  4. Eom JH, Kim SC, Zhang BT. AptaCDSS-E: A classifier ensemble-based clinical decision support system for cardiovascular disease level prediction. Expert Systems with Applications. 2008 May 1;34(4):2465-79.
  5. Paredes S, Rocha T, De Carvalho P, Henriques J, Harris M, Morais J. Long term cardiovascular risk models’ combination. Computer methods and programs in biomedicine. 2011 Mar 1;101(3):231-42.
  6. Shilaskar S, Ghatol A. Feature selection for medical diagnosis: Evaluation for cardiovascular diseases. Expert Systems with Applications. 2013 Aug 1;40(10):4146-53.
  7. Nambiar R, Bhardwaj R, Sethi A, Vargheese R. A look at challenges and opportunities of big data analytics in healthcare. InBig Data, 2013 IEEE International Conference on 2013 Oct 6 (pp. 17-22). IEEE.
  8. Gandhi M, Singh SN. Predictions in heart disease using techniques of data mining. InFuturistic Trends on Computational Analysis and Knowledge Management (ABLAZE), 2015 International Conference on 2015 Feb 25 (pp. 520-525). IEEE.
  9. Theresa Princy. R and J. Thomas, "Human Heart Disease Prediction System using DM Techniques", 2016 International Conference on Circuit, Power and Computing Technologies.
  10. Sudeshna P, Bhanumathi S, Hamlin MA. Identifying symptoms and treatment for heart disease from biomedical literature using text data mining. InComputation of Power, Energy Information and Commuincation (ICCPEIC), 2017 International Conference on 2017 Mar 22 (pp. 170-174). IEEE.
  11. Pang SL, Gong JZ. C5. 0 classification algorithm and application on individual credit evaluation of banks. Systems Engineering-Theory & Practice. 2009 Dec 1;29(12):94-104.

Downloads

Published

2018-04-30

Issue

Section

Research Articles

How to Cite

[1]
M. Nivedhika, " Risk Prediction for Coronary Heart Disease Using C5.0 Decision Tree Algorithm , IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 3, pp.1003-1011, March-April-2018.