Hierarchical Classification Using a New Hybrid Feature Selection Algorithm
Keywords:
Feature Selection, Hierarchical Single-Label Classification, Variable Neighborhood Search, Filter, Wrapper.Abstract
A common preprocessing step in the data mining industry is feature selection. Reducing the quantity of original dataset characteristics is one of its goals in order to enhance the accuracy of a prediction model. To the best of our knowledge, very few research in the literature address feature selection for the context of hierarchical classification, despite the advantages of feature selection for the classification problem. The general variable neighbourhood search metaheuristic is used to support the innovative feature selection approach that is proposed in this research. The method combines a filter step and a wrapper phase, and a global model hierarchical classifier is used to assess feature subsets. We conducted computational tests to verify the impact of the suggested approach on classification performance while employing two proposed global hierarchical classifiers, using various datasets from the proteins and pictures domains. in the written word. According to statistical testing, our feature selection strategy consistently produced prediction results that were superior to or on par with those achieved by employing all features while using fewer features, which supports its efficacy in the context of hierarchical categorization.
References
- J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques, 3rd ed. San Francisco, CA, USA: Morgan Kaufmann, 2011.
- C. N. Silla and A. A. Freitas, ‘‘A survey of hierarchical classification across different application domains,’’ Data Mining Knowl. Discovery, vol. 22, nos. 1–2, pp. 31–72, 2011.
- D. Koller and M. Sahami, ‘‘Hierarchically classifying documents using very few words,’’ in Proc. 14th Int. Conf. Mach. Learn. San Francisco, CA, USA: Morgan Kaufmann, 1997, pp. 170–178.
- A. Secker, M. N. Davies, A. A. Freitas, E. Clark, J. Timmis, and D. R. Flower, ‘‘Hierarchical classification of G-protein- coupled receptors with data-driven selection of attributes and classifiers,’’ Int. J. Data Mining Bioinf., vol. 4, pp. 191–210, Jan. 2010.
- B. C. Paes, A. Plastino, and A. A. Freitas, ‘‘Exploring attribute selection in hierarchical classification,’’ J. Inf. Data Manage., vol. 5, no. 1, pp. 124–133, 2014.
- H. Zhao, P. Zhu, P. Wang, and Q. Hu, ‘‘Hierarchical feature selection with recursive regularization,’’ in Proc. 26th Int. Joint Conf. Artif. Intell., Melbourne, VIC, Australia, 2017, pp. 3483–3489.
- Q. Tuo, H. Zhao, and Q. Hu, ‘‘Hierarchical feature selection with subtree based graph regularization,’’ Knowl.-Based Syst., vol. 163, pp. 996–1008, Jan. 2019.
- H. Huang and H. Liu, ‘‘Feature selection for hierarchical classification via joint semantic and structural information of labels,’’ Knowl.- Based Syst., vol. 195, May 2020, Art. no. 105655.
- I. Slavkov, J. Karcheska, D. Kocev, S. Kalajdziski, and S. Dzeroski, ‘‘ReliefF for hierarchical multi-label classification,’’ in New Frontiers in Mining Complex Patterns (Lecture Notes in Computer Science). Cham, Switzerland: Springer, 2014, pp. 148–161.
- T. N. Dias and L. H. C. Merschmann, ‘‘Adaptation of the symmetric uncertainty measure for feature selection in single-label hierarchical classification context,’’ (in Portuguese), in Proc. Ann. Nat. Meeting Artif. Intell. Comput., Natal, Brazil, 2015, pp. 142– 149.
Downloads
Published
Issue
Section
License
Copyright (c) IJSRCSEIT

This work is licensed under a Creative Commons Attribution 4.0 International License.