Gradual Class Evolution Detection Using Class Based Ensembles

Authors

  • J Linita Lyle  Department of Computer Science and Engineering, M A College of Engineering Kothamangalam, Kerala, India
  • Soumya Balan P  Department of Computer Science and Engineering, M A College of Engineering Kothamangalam, Kerala, India
  • Prof. Leya Elizabeth Sunny  Department of Computer Science and Engineering, M A College of Engineering Kothamangalam, Kerala, India

Keywords:

Data Stream Mining, Concept Drift, Class Evolution

Abstract

The recent advances in hardware and software have enabled the capture of different measurements of data in a wide range of fields. These measurements are generated continuously and in a very high fluctuating data rates. Mining data streams is concerned with extracting knowledge structures represented in models and patterns in non- stopping streams of information. The research in data stream mining has gained a high attraction due to the importance of its applications and the increasing generation of streaming information. Mining data streams is concerned with extracting knowledge structures represented in models and patterns in non-stopping streams of information. Data Stream classification poses major challenges than classifying static data because of several unique properties of data streams such as infinite length, concept drift, concept evolution and feature evolution. While extensive work has been done in the area of concept drift, concept evolution, a phenomena that induces concept drift has gained little recognition. Class evolution basically focuses on 3 aspects: the phenomenon of class emergence, disappearance and reoccurrence and is an important research topic for data stream mining. Most of the previous works implicitly regard class evolution as a transient change, which is not true for many real-world problems as in many real world applications class evolution is a gradual process. A class-based ensemble approach, namely Class-Based ensemble for Class Evolution (CBCE), is adopted to handle class evolution. By maintaining a base learner for each class and dynamically updating the base learners with new data, CBCE can rapidly adjust to class evolution. A novel under-sampling method for the base learners is used to handle the dynamic class- imbalance problem caused by the gradual evolution of classes. Based on the above concepts of gradual class evolution, a dataset containing records of tweets made in twitter is evaluated at different time stamps by converting the unstructured, dynamic dataset into a more compact form, to evaluate and analyse concept evolution.

References

  1. Y. Sun, K. Tang, L. L. Minku, S. Wang and X. Yao, "Online Ensemble Learning of Data Streams with Gradually Evolved Classes," in IEEE Transactions on Knowledge and Data Engineering, vol. 28, no. 6, pp. 1532-1545, June 1 2016.
  2. J. Gama, I. Zliobait_ e, A. Bifet, M. Pechenizkiy, and A.Bouchachia, "A survey on concept drift adaptation," ACM Comput. Surv.,vol. 46, no. 4, pp. 44:1–44:37, Mar. 2014.
  3. M. Masud, Q. Chen, L. Khan, C. Aggarwal, J. Gao, J. Han, and B. Thuraisingham, "Addressing concept-evolution in concept drifting data streams," in Proc. IEEE 10th Int. Conf. Data Mining, Dec. 2010, pp. 929–934
  4. M. M. Gaber, A. Zaslavsky, and S. Krishnaswamy, "Mining data streams: A review," SIGMOD Rec., vol. 34, no. 2, pp. 18–26, 2005.
  5. S. Wang, L. Minku, and X. Yao, "A learning framework for online class imbalance learning," in Proc. IEEE Symp. Comput. Intell.Ensemble Learn., Apr. 2013, pp. 36–45
  6. Heitor Murilo Gomes, Jean Paul Barddal, Fabricio Enembreck and Albert Bifet, "A Survey on Ensemble Learning for Data Stream Classification" in ACM Computing Surveys, Vol. 50, no. 2, April 2017
  7. A. Bifet, G. Holmes, B. Pfahringer, R. Kirkby, and R. Gavald. New ensemble methods for evolving data streams. In SIGKDD, pages 139–148, 2009.

Downloads

Published

2017-12-31

Issue

Section

Research Articles

How to Cite

[1]
J Linita Lyle, Soumya Balan P, Prof. Leya Elizabeth Sunny, " Gradual Class Evolution Detection Using Class Based Ensembles, International Journal of Scientific Research in Science, Engineering and Technology(IJSRSET), Print ISSN : 2395-1990, Online ISSN : 2394-4099, Volume 3, Issue 7, pp.26-31, September-2017.