Sequential Genetic Search for Ensemble Feature Selection
Metadata:Show full item record
Citation:Tsymbal, Alexey; Cunningham, Padraig. 'Sequential Genetic Search for Ensemble Feature Selection'. - Dublin, Trinity College Dublin, Department of Computer Science, TCD-CS-2005-40, 2005, pp6
Ensemble learning constitutes one of the main directions in machine learning and data mining. Ensembles allow us to achieve higher accuracy, which is often not achievable with single models. One technique, which proved to be effective for constructing an ensemble of diverse classifiers, is the use of feature subsets. Among different approaches to ensemble feature selection, genetic search was shown to perform best in many domains. In this paper, a new strategy GAS-SEFS, Genetic Algorithm-based Sequential Search for Ensemble Feature Selection, is introduced. Instead of one genetic process, it employs a series of processes, the goal of each of which is to build one base classifier. Experiments on 21 data sets are conducted, comparing the new strategy with a previously considered genetic strategy for different ensemble sizes and for five different ensemble integration methods. The experiments show that GAS-SEFS, although being more time-consuming, often builds better ensembles, especially on data sets with larger numbers of features.
Science Foundation Ireland
Publisher:Trinity College Dublin, Department of Computer Science
Series/Report no:Computer Science Technical Report