Online Ensemble Learning of Data Streams with Gradually Evolved

Authors

  • A. Auysha  Department of M.Sc(Software Engineering), PSN College of Engineering & Technology, Tirunelveli, Tamilnadu, India
  • Dr. A. Jayachandran  Department of M.Sc(Software Engineering), PSN College of Engineering & Technology, Tirunelveli, Tamilnadu, India

Keywords:

Class-based ensemble for class evolution, class incremental learning, Drift detection method, Machine learning repository, Twitter crawl dataset and chunk-by-chunk.

Abstract

Class evolution is an important research topic for data stream mining. All previous studies implicitly regard class evolution as a transient change, which is not true for many real-world problems. This paper concerns the scenario where classes emerge or disappear gradually. A class-based ensemble approach, namely class-based ensemble for class evolution.Emprical studies demonstrate the effectiveness of CBCE in various class evolution scenarios in comparison to existing class evolution adaptation methods. With the rapid development of incremental learning and online,learning,mining tasks in the context of data stream have been widely studied.Generally,data stream mining refers to the mining task that are conducted on a sequence of rapidly arriving data records. As the environment where the data are collected may change dynamically, the data distribution may also change accordingly. This phenomenon referred to as concept drift is one of the most important challenges in data stream mining. A data stream mining technique should be capable of constructing and dynamically updating a model in order to learn dynamic changes.

References

  1. M. M. Gaber, A. Zaslavsky, and S. Krishnaswamy, "Mining data streams: A review," SIGMOD Rec., vol. 34, no. 2, pp. 18–26, 2005.
  2. P. Domingos and G. Hulten, "Mining high-speed data streams," in Proc. 6th ACM SIGKDD Int. Conf. Know. Discovery Data Mining, 2000, pp. 71–80.
  3. J. Gama, I. Zliobait e, A. Bifet, M. Pechenizkiy, and A. Bouchachia, "A survey on concept drift adaptation," ACM Compute. Surv. vol. 46, no. 4, pp. 44:1–44:37, Mar. 2014.
  4. L. Minku, A. White, and X. Yao, "The impact of diversity on online ensemble learning in the presence of concept drifts," IEEE Trans. Know. Data Eng., vol. 22, no. 5, pp. 730–742, May 2010.
  5. L. Minku and X. Yao, "DDD: A new ensemble approach for dealing with concept drift," IEEE Trans. Know. Data Eng., vol. 24, no. 4, pp. 619–633, Apr. 2012.
  6. A. Bifet, G. Holmes, B. Pfahringer, R. Kirkby, and R. Gavalda, "New ensemble methods for evolving data streams," in Proc. 15th ACM SIGKDD Int. Conf. Know. Discovery Data Mining, 2009,pp. 139–148.
  7. J. Liu, X. Li, and W. Zhong, "Ambiguous decision trees for mining concept-drifting data streams," Pattern Recog. Lett. vol. 30, no. 15,pp. 1347–1355, 2009.
  8. Z.-H. Zhou and Z.-Q. Chen, "Hybrid decision tree," Know.-Based Syst., vol. 15, no. 8, pp. 515–528, 2002.
  9. M. Masud, J. Gao, L. Khan, J. Han, and B. Thuraisingham,"Integrating novel class detection with classification for concept drifting data streams," in Proc. Eur. Conf. Mach. Learn. Know. Discovery Databases, 2009, vol. 5782, pp. 79–94.
  10. M. Masud, Q. Chen, L. Khan, C. Aggarwal, J. Gao, J. Han, and B. Thuraisingham, "Addressing concept-evolution in concept drifting data streams," in Proc. IEEE 10th Int. Conf. Data Mining, Dec. 2010, pp. 929–934.

Downloads

Published

2017-04-30

Issue

Section

Research Articles

How to Cite

[1]
A. Auysha, Dr. A. Jayachandran, " Online Ensemble Learning of Data Streams with Gradually Evolved, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 2, Issue 2, pp.98-102, March-April-2017.