Abstract
Learning decision trees involves choosing an attribute on which to split the dataset. The efficiency of decision trees depends on this choice. ID3 and CART, related to the classical algorithms for learning decision trees, enumerate all the attributes of the original sample, which is time-consuming, since it’s necessary to calculate the value of the informative criterion for all objects for all attributes. Previously, it was proved that the use of evolutionary algorithms for optimizing thresholds in decision tree learning algorithms can significantly speed up the learning process without loss of classification quality. Studies have also been conducted comparing various attribute selection methods, which have shown the high efficiency of the Separation Measure method. But it is known that methods in a team can work more efficiently, so the article compares the effectiveness of attribute separation methods with their ensemble. Due to the fact that a task can have hundreds of attributes, the classic voting methods won’t work. Therefore, a voting algorithm for attribute selection was developed and implemented. The method was evaluated on several classification problems. The classification accuracy is used as an estimate of the effectiveness of the methods and is averaged over all classification tasks.
Copyright information
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
About this article
Publication Date
27 February 2023
Article Doi
eBook ISBN
978-1-80296-960-3
Publisher
European Publisher
Volume
1
Print ISBN (optional)
-
Edition Number
1st Edition
Pages
1-403
Subjects
Hybrid methods, modeling and optimization, complex systems, mathematical models, data mining, computational intelligence
Cite this article as:
Mitrofanov, S. A., & Semenkin, E. S. (2023). Ensembling Methods for Selecting a Splitting Attribute in Decision Trees Learning Algorithms. In P. Stanimorovic, A. A. Stupina, E. Semenkin, & I. V. Kovalev (Eds.), Hybrid Methods of Modeling and Optimization in Complex Systems, vol 1. European Proceedings of Computers and Technology (pp. 372-378). European Publisher. https://doi.org/10.15405/epct.23021.46