Mining Sequential Risk Patterns for Early Assessment of COPD

Authors

  • Saniya PK  Computer Science and Engineering, MEA Engineering College, Perinthalmanna, Kerala, India
  • Bineesh V.  Computer Science and Engineering, MEA Engineering College, Perinthalmanna, Kerala, India

DOI:

https://doi.org//10.32628/CSEIT1953165

Keywords:

COPD, Sequential pattern, SPADE, CBS, Decision tree, Classification

Abstract

Nowadays, chronic diseases have been among the major concerns in medical fields since they may cause heavy burden on healthcare resources and disturb the quality of life. Chronic Obstructive Pulmonary Disease (COPD) is one kind of popular chronic diseases. COPD takes long period to evolve from mild symptoms (Stage I) to severe illness (Stage IV) and death. Earlier the disease is detected, the better the scope for effective treatment and improved control of symptom development. Therefore, the early of COPD is beneficial for better treatment. The paper examines the novel system for early appraisal on chronic illnesses by mining sequential risk patterns with interim data from diagnostic clinical records utilizing sequential rules mining and classification modelling systems. The system consists of four phases namely data pre-processing, risk pattern mining, classification modelling. SPADE algorithm and CBS algorithm used for risk pattern mining and classification. Decision tree algorithm is compared with the SPADE algorithm, and SPADE showing a better accuracy when comparing with decision tree.

References

  1. Asha, T., Natarajan, S., and Murthy, K. B. (2011). Associative classification in the prediction of tuberculosis. In Proceedings of the International Conference & Workshop on Emerging Trends in Technology, pages 1327–1330. ACM.
  2. Cheng, Y.-T., Lin, Y.-F., Chiang, K.-H., and Tseng, V. S. (2017). Mining sequential risk patterns from large-scale clinical databases for early assessment of chronic diseases: a case study on chronic obstructive pulmonary disease. IEEE journal of biomedical and health informatics, 21(2):303–311.
  3. Chin, C. Y., Weng, M. Y., Lin, T. C., Cheng, S. Y., Yang, Y. H. K., and Tseng, V. S. (2015). Mining disease risk patterns from nationwide clinical databases for the assessment of early rheumatoid arthritis risk. PloS one, 10(4):e0122508.
  4. Soni, J., Ansari, U., Sharma, D., and Soni, S. (2011). Predictive data mining for med- ical diagnosis: An overview of heart disease prediction. International Journal of Computer Applications, 17(8):43–48.
  5. Tseng, V. S. and Lee, C.-H. (2009). Effective temporal data classification by integrating sequential pattern mining and probabilistic induction. Expert Systems with Applications, 36(5):9524–9532.
  6. Zaki, M. J. (1998). Efficient enumeration of frequent sequences. In Proceedings of the seventh international conference on Information and knowledge management, pages 68–75. ACM.
  7. Han, Jiawei, Micheline Kamber, and Data Mining. "Concepts and techniques." Morgan Kaufmann 340 (2001): 94104-3205.
  8. Safavian, S. Rasoul, and David Landgrebe. "A survey of decision tree classifier methodology." IEEE transactions on systems, man, and cybernetics 21.3 (1991): 660-674.

Downloads

Published

2019-06-30

Issue

Section

Research Articles

How to Cite

[1]
Saniya PK, Bineesh V., " Mining Sequential Risk Patterns for Early Assessment of COPD, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 5, Issue 3, pp.496-502, May-June-2019. Available at doi : https://doi.org/10.32628/CSEIT1953165