Risk Prediction for Coronary Heart Disease Using C5.0 Decision Tree Algorithm

Authors(1) :-M. Nivedhika

Heart Disease is now-a-days one of the most leading causes of death rate in the worlds. Here we aim to explain the CHD predictive model using C5.0 one of the DT algorithm. In a every single day there are huge amount of data were providing in the hospitals. For making a correct decision for the disease some of the hidden information are not mined properly or effectively. So, here we are using some DM techniques which can make a solution to this situation. This work has been developed using the DM techniques namely DT (CART, C5.0) algorithms and classification (Naive Bayes, Neural Network) algorithms. Result shows each technique has its own specific strength in realizing the objective of the defined mining goals. Using attributes (age,sex,cp, trestbps, chol, fbs, restecg, thalach, exang, oldpeak, slope, thal, ca, num) it can predict the likelihood of patients getting heart disease. In this paper we studied and validated the predictive power of DM algorithms by comparing the performance of C5.0 with two classifier algorithm Naive Bayes, Neural Network and one DT algorithms CART. At finally we are comparing it Accuracy, Sensitivity, Specificity with two different types of classification algorithm and one DT algorithm. Contrary to the former study, the C5.0 algorithm performed best than the other two classifiers algorithm and one DT algorithm in predicting CHD, and C5.0 have the highest predictive power. This paper provides an insight about C5.0 DT algorithm used to predict the heart diseases.

Authors and Affiliations

M. Nivedhika
Computer Science Department, Pondicherry University, Pondicherry, India

Decision Tree(DT), Data Mining(DM) and C5.0 algorithm

  1. Ding K, Bailey KR, Kullo IJ. Genotype-informed estimation of risk of coronary heart disease based on genome-wide association data linked to the electronic medical record. BMC cardiovascular disorders. 2011 Dec;11(1):66.
  2. Janssen DJ, Wouters EF, Schols JM, Spruit MA. Self-perceived symptoms and care needs of patients with severe to very severe chronic obstructive pulmonary disease, congestive heart failure or chronic renal failure and its consequences for their closest relatives: the research protocol. BMC Palliative Care. 2008 Dec;7(1):5.
  3. Lu SE, Beckles GL, Crosson JC, Bilik D, Karter AJ, Gerzoff RB, Lin Y, Ross SV, McEwen LN, Waitzfelder BE, Marrero D. Evaluation of risk equations for prediction of short-term coronary heart disease events in patients with long-standing type 2 diabetes: the Translating Research into Action for Diabetes (TRIAD) study. BMC endocrine disorders. 2012 Dec;12(1):12.
  4. Eom JH, Kim SC, Zhang BT. AptaCDSS-E: A classifier ensemble-based clinical decision support system for cardiovascular disease level prediction. Expert Systems with Applications. 2008 May 1;34(4):2465-79.
  5. Paredes S, Rocha T, De Carvalho P, Henriques J, Harris M, Morais J. Long term cardiovascular risk models’ combination. Computer methods and programs in biomedicine. 2011 Mar 1;101(3):231-42.
  6. Shilaskar S, Ghatol A. Feature selection for medical diagnosis: Evaluation for cardiovascular diseases. Expert Systems with Applications. 2013 Aug 1;40(10):4146-53.
  7. Nambiar R, Bhardwaj R, Sethi A, Vargheese R. A look at challenges and opportunities of big data analytics in healthcare. InBig Data, 2013 IEEE International Conference on 2013 Oct 6 (pp. 17-22). IEEE.
  8. Gandhi M, Singh SN. Predictions in heart disease using techniques of data mining. InFuturistic Trends on Computational Analysis and Knowledge Management (ABLAZE), 2015 International Conference on 2015 Feb 25 (pp. 520-525). IEEE.
  9. Theresa Princy. R and J. Thomas, "Human Heart Disease Prediction System using DM Techniques", 2016 International Conference on Circuit, Power and Computing Technologies.
  10. Sudeshna P, Bhanumathi S, Hamlin MA. Identifying symptoms and treatment for heart disease from biomedical literature using text data mining. InComputation of Power, Energy Information and Commuincation (ICCPEIC), 2017 International Conference on 2017 Mar 22 (pp. 170-174). IEEE.
  11. Pang SL, Gong JZ. C5. 0 classification algorithm and application on individual credit evaluation of banks. Systems Engineering-Theory & Practice. 2009 Dec 1;29(12):94-104.

Publication Details

Published in : Volume 3 | Issue 3 | March-April 2018
Date of Publication : 2018-04-30
License:  This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 1003-1011
Manuscript Number : CSEIT1833341
Publisher : Technoscience Academy

ISSN : 2456-3307

Cite This Article :

M. Nivedhika, "Risk Prediction for Coronary Heart Disease Using C5.0 Decision Tree Algorithm ", International Journal of Scientific Research in Computer Science, Engineering and Information Technology (IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 3, pp.1003-1011, March-April-2018.
Journal URL : http://ijsrcseit.com/CSEIT1833341

Article Preview

Follow Us

Contact Us