Machine Learning Prediction Algorithm for Chronic Diseases Detection over Bigdata
Keywords:
Big data analytics, Machine Learning, Healthcare.Abstract
With big data growth in biomedical and healthcare communities, accurate analysis of medical data benefits early disease detection, patient care and community services. However, the analysis accuracy is reduced when the quality of medical data is incomplete. Moreover, different regions exhibit unique characteristics of certain regional diseases, which may weaken the prediction of disease outbreaks. In this paper, we streamline machine- learning algorithms for effective prediction of chronic disease outbreak in disease-frequent communities. We experiment the modified prediction models over real-life hospital data collected from central China in 2013- 2015. To overcome the difficulty of incomplete data, we use a latent factor model to reconstruct the missing data. We experiment on a regional chronic disease of cerebral infarction. To the best of our knowledge, none of the existing work focused on both data types in the area of medical big data analytics. Compared to several typical prediction algorithms, the prediction accuracy of our proposed algorithm reaches 94.8% with a convergence speed which is faster than that of the CNN-based unimodal disease risk prediction (CNN-UDRP) algorithm.
References
- A. B. Author, “Title of chapter in the book,” in Title of His Published Book, xth ed. City of Publisher, Country if not
- P. Groves, B. Kayyali, D. Knott, and S. V. Kuiken, “The 'big data' revolution in healthcare: Accelerating value and innovation,” 2016.
- M. Chen, S. Mao, and Y. Liu, “Big data: A survey,” Mobile Networks and Applications, vol. 19, no. 2, pp. 171–209, 2014.
- S. Bandyopadhyay, J. Wolfson, D. M. Vock, G. Vazquez-Benitez, G. Adomavicius, M. Elidrisi, P. E.Johnson, and P. J. O'Connor, “Data mining for censored qtime-to-event data: a bayesian network model for predicting cardiovascular risk from electronic health record data,” Data Mining and Knowledge Discovery, vol. 29, no. 4, pp. 1033–1069, 2015..
- D. Tian, J. Zhou, Y. Wang, Y. Lu, H. Xia, and Z. Yi, “A dynamic and self-adaptive network selection method for multimode communications in heterogeneous vehicular telematics,” IEEE Transactions on Intelligent Transportation Systems, vol. 16, no. 6, pp. 3033–3049, 2015.
- N. Nori, H. Kashima, K. Yamashita, H. Ikai, and Y. Imanaka, “Simultaneous modeling of multiple diseases for mortality prediction in acute hospital care,” in Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2015, pp. 855–864.
- K. Hwang, M. Chen, “Big Data Analytics for Cloud/IoT and Cognitive Computing,” Wiley, U.K., ISBN: 9781119247029, 2017.
- S. M. Chu, W.-T. Shih, Y.-H. Yang, P.-C. Chen, and Y.-H. Chu, “Use of traditional chinese medicine in patients with hyperlipidemia: A population-based study in taiwan,” Journal of ethnopharmacology, vol. 168, pp. 129–135, 2015.
- B. Qian, X. Wang, N. Cao, H. Li, and Y.-G. Jiang, “A relative similarity based method for interactive patient risk prediction,” Data Mining and Knowledge Discovery, vol. 29, no. 4, pp. 1070–1093, 2015.
Downloads
Published
Issue
Section
License
Copyright (c) IJSRCSEIT

This work is licensed under a Creative Commons Attribution 4.0 International License.