Comparative Analysis of Unsupervised Concept Drift Detection Techniques in High-Dimensional Biomedical Data Streams
DOI:
https://doi.org/10.32628/CSEIT25113302Keywords:
Concept Drift Detection, Unsupervised Learning, High-Dimensional Data Streams, AdaBoost, Diversity-Induced Ensemble (DIE), ADWIN, SPRT, Page-Hinkley Test, MIMIC-III, UK Biobank, MedMNIST, Biomedical Data, Real-Time Analytics, Drift Simulation, Ensemble LearningAbstract
In the area of real-time analytics, the ability to detect concept drift shifts in data distribution over time is vital for maintaining the reliability of predictive models. This analysis presents a comprehensive comparative analysis for five Unsupervised Concept Drift Detection Algorithms Adaptive Boosting (AdaBoost), Diversity-Induced Ensemble (DIE), Adaptive Sliding Window (ADWIN), Sequential Probability Ratio Test (SPRT), and Page-Hinkley Test (PHT) with a focus on high-dimensional biomedical data streams. The evaluation is conducted using three large-scale and diverse biomedical datasets: MIMIC-III/IV, UK Biobank, and MedMNIST, each representing a distinct challenge in terms of dimensionality, temporal variability and data type (tabular, genomic, and imaging). Performance is assessed across key metrics including Detection Delay, Memory Usage, Execution Time, and post-drift classification effectiveness (Precision, Recall, F1-Score, and Accuracy). Both synthetic and real-world drifts are incorporated to simulate dynamic environments. The findings reveal that ensemble-based methods such as AdaBoost and DIE outperform statistical approaches in handling noisy, sparse, and high-dimensional streams, offering superior adaptability and robustness. This research contributes a systematic evaluation framework and empirical insights to guide the deployment of unsupervised drift-aware systems in healthcare and other data-intensive domains.
Downloads
References
K. S. Desale and S. Shinde, "Real-Time Concept Drift Detection and Its Application to ECG Data," International Journal of Online and Biomedical Engineering (iJOE), vol. 17, no. 10, pp. 160–170, Oct. 2021. https://doi.org/10.3991/ijoe.v17i10.25473Online Journals
A. R. M. S. Abdul Razak et al., "A novel technique for detecting sudden concept drift in healthcare data using multi-linear artificial intelligence techniques," Frontiers in Artificial Intelligence, vol. 5, Aug. 2022. https://doi.org/10.3389/frai.2022.950659Frontiers
Gama, J., Žliobaitė, I., Bifet, A., Pechenizkiy, M., & Bouchachia, A. (2014), “A survey on concept drifts adaptation”, ACM Computing Surveys (CSUR), 46(4), 1-37.
Gama, J., Medas, P., Castillo, G., & Rodrigues, P. (2004), “ Learning with drift detection”, In SBIA Brazilian Symposium on Artificial Intelligence (pp. 286–295). Springer.
Baena-García, M., del Campo-Ávila, J., Fidalgo, R., Bifet, A., Gavaldà, R., & Morales-Bueno, R. (2006), “Early drift detection method”, In Fourth international workshop on knowledge discovery from data streams (Vol. 6, pp. 77-86).
Brzeziński, D., & Stefanowski, J. (2014), “Reacting to different types of concept drift: The accuracy updated ensemble algorithm”, IEEE Transactions on Neural Networks and Learning Systems, 25(1), 81–94.
A. Razak et al., "A novel technique for detecting sudden concept drift in healthcare data using multi-linear artificial intelligence techniques," Frontiers in Artificial Intelligence, vol. 5, Aug. 2022. [Online]. Available: https://doi.org/10.3389/frai.2022.950659Frontiers+1Frontiers+1
L. Yang and A. Shami, "A Lightweight Concept Drift Detection and Adaptation Framework for IoT Data Streams," arXiv preprint, arXiv: 2104.10529, Apr. 2021. [Online]. Available: https://arxiv.org/abs/2104.10529arXiv
A. K. Gangwar, S. Kumar, and A. Mishra, "A Paired Learner-Based Approach for Concept Drift Detection and Adaptation in Software Defect Prediction," Applied Sciences, vol. 11, no. 14, p. 6663, Jul. 2021. [Online]. Available: https://doi.org/10.3390/app11146663MDPI
J. P. Barddal, H. M. Gomes, and F. Enembreck, "Advances on Concept Drift Detection in Regression Tasks using Social Networks Theory," arXiv preprint, arXiv:2304.09788, Apr. 2023. [Online]. Available: https://arxiv.org/abs/2304.09788arXiv
S. R. Basavappa et al., "AdaBoost based hybrid concept drift detection approach for mental health prediction," in Proc. 15th Int. Conf. Contemporary Computing (IC3), Noida, India, Sep. 2023. [Online]. Available: https://doi.org/10.1145/3607947.3608020ACM Digital Library
Žliobaitė, I. (2010), “Learning under concept drift: an overview”, arXiv preprint arXiv:1010.4784.
Liu, H., & Motoda, H. (2012), “Feature selection for knowledge discovery and data mining”, Springer Science & Business Media.
Johnson, A. E. W., Pollard, T. J., Shen, L., et al. (2016), “MIMIC-III, a freely accessible critical care database”, Scientific Data, 3, 160035.
Bycroft, C., Freeman, C., Petkova, D., et al. (2018), “The UK Biobank resource with deep phenotyping and genomic data,” Nature, 562(7726), 203–209.
Yang, J., Shi, R., Wei, D., et al. (2021), “MedMNIST: A lightweight benchmark for medical image classification”, In Medical Image Analysis, 68, 101552.
Harutyunyan, H., Khachatrian, H., Kale, D. C., Ver Steeg, G., & Galstyan, A. (2019), “ Multitask learning and benchmarking with clinical time series data”, Scientific Data, 6(1), 96.
Tajbakhsh, N., Jeyaseelan, L., Li, Q., Chiang, J. N., Wu, Z., & Ding, X. (2020), “ Embracing imperfect datasets: A review of deep learning solutions for medical image segmentation,” Medical Image Analysis, 63, 101693.
Miotto, R., Wang, F., Wang, S., Jiang, X., & Dudley, J. T. (2018), “Deep learning for healthcare: Review, opportunities and challenges,” Briefings in Bioinformatics, 19(6), 1236–1246.
D. Mulimani et al., "Adaptive Ensemble Learning with Concept Drift Detection for Intrusion Detection," in Data Engineering and Intelligent Computing, vol. 1407, V. Bhateja et al., Eds. Singapore: Springer, 2021, pp. 327–336. [Online]. Available: https://doi.org/10.1007/978-981-16-0171-2_31SpringerLink
L. Yang, D. M. Manias, and A. Shami, "PWPAE: An Ensemble Framework for Concept Drift Adaptation in IoT Data Streams," arXiv preprint, arXiv: 2109.05013, Sep. 2021. [Online]. Available: https://arxiv.org/abs/2109.05013arXiv.
L. Baier et al., "Detecting Concept Drift With Neural Network Model Uncertainty," arXiv preprint, arXiv: 2107.01873, Jul. 2021. [Online]. Available: https://arxiv.org/abs/2107.01873arXiv.
T. Yamada and H. Matsutani, "A Sequential Concept Drift Detection Method for On-Device Learning on Low-End Edge Devices," arXiv preprint, arXiv:2212.09637, Dec. 2022. [Online]. Available: https://arxiv.org/abs/2212.09637arXiv.
K. S. Desale and S. Shinde, "Real-Time Concept Drift Detection and Its Application to ECG Data," International Journal of Online and Biomedical Engineering (iJOE), vol. 17, no. 10, pp. 160–170, Oct. 2021. [Online]. Available: https://doi.org/10.3991/ijoe.v17i10.25473Online Journals.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 International Journal of Scientific Research in Computer Science, Engineering and Information Technology

This work is licensed under a Creative Commons Attribution 4.0 International License.