Feature selection via regularized trees

Houtao Deng; George Runger

doi:10.1109/IJCNN.2012.6252640

Feature selection via regularized trees

Houtao Deng, George Runger

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

128 Scopus citations

Abstract

We propose a tree regularization framework, which enables many tree models to perform feature selection efficiently. The key idea of the regularization framework is to penalize selecting a new feature for splitting when its gain (e.g. information gain) is similar to the features used in previous splits. The regularization framework is applied on random forest and boosted trees here, and can be easily applied to other tree models. Experimental studies show that the regularized trees can select high-quality feature subsets with regard to both strong and weak classifiers. Because tree models can naturally deal with categorical and numerical variables, missing values, different scales between variables, interactions and nonlinearities etc., the tree regularization framework provides an effective and efficient feature selection solution for many practical problems.

Original language	English (US)
Title of host publication	2012 International Joint Conference on Neural Networks, IJCNN 2012
DOIs	https://doi.org/10.1109/IJCNN.2012.6252640
State	Published - 2012
Event	2012 Annual International Joint Conference on Neural Networks, IJCNN 2012, Part of the 2012 IEEE World Congress on Computational Intelligence, WCCI 2012 - Brisbane, QLD, Australia Duration: Jun 10 2012 → Jun 15 2012

Publication series

Name	Proceedings of the International Joint Conference on Neural Networks

Other

Other	2012 Annual International Joint Conference on Neural Networks, IJCNN 2012, Part of the 2012 IEEE World Congress on Computational Intelligence, WCCI 2012
Country/Territory	Australia
City	Brisbane, QLD
Period	6/10/12 → 6/15/12

Keywords

RBoost
RRF
regularized boosted trees
regularized random forest
tree regularization

ASJC Scopus subject areas

Software
Artificial Intelligence

Access to Document

10.1109/IJCNN.2012.6252640

Cite this

Deng, H & Runger, G 2012, Feature selection via regularized trees. in 2012 International Joint Conference on Neural Networks, IJCNN 2012., 6252640, Proceedings of the International Joint Conference on Neural Networks, 2012 Annual International Joint Conference on Neural Networks, IJCNN 2012, Part of the 2012 IEEE World Congress on Computational Intelligence, WCCI 2012, Brisbane, QLD, Australia, 6/10/12. https://doi.org/10.1109/IJCNN.2012.6252640

@inproceedings{fa3915a6fdd54be384f52ab94fc3597d,

title = "Feature selection via regularized trees",

abstract = "We propose a tree regularization framework, which enables many tree models to perform feature selection efficiently. The key idea of the regularization framework is to penalize selecting a new feature for splitting when its gain (e.g. information gain) is similar to the features used in previous splits. The regularization framework is applied on random forest and boosted trees here, and can be easily applied to other tree models. Experimental studies show that the regularized trees can select high-quality feature subsets with regard to both strong and weak classifiers. Because tree models can naturally deal with categorical and numerical variables, missing values, different scales between variables, interactions and nonlinearities etc., the tree regularization framework provides an effective and efficient feature selection solution for many practical problems.",

keywords = "RBoost, RRF, regularized boosted trees, regularized random forest, tree regularization",

author = "Houtao Deng and George Runger",

note = "Copyright: Copyright 2012 Elsevier B.V., All rights reserved.; 2012 Annual International Joint Conference on Neural Networks, IJCNN 2012, Part of the 2012 IEEE World Congress on Computational Intelligence, WCCI 2012 ; Conference date: 10-06-2012 Through 15-06-2012",

year = "2012",

doi = "10.1109/IJCNN.2012.6252640",

language = "English (US)",

isbn = "9781467314909",

series = "Proceedings of the International Joint Conference on Neural Networks",

booktitle = "2012 International Joint Conference on Neural Networks, IJCNN 2012",

}

TY - GEN

T1 - Feature selection via regularized trees

AU - Deng, Houtao

AU - Runger, George

PY - 2012

Y1 - 2012

N2 - We propose a tree regularization framework, which enables many tree models to perform feature selection efficiently. The key idea of the regularization framework is to penalize selecting a new feature for splitting when its gain (e.g. information gain) is similar to the features used in previous splits. The regularization framework is applied on random forest and boosted trees here, and can be easily applied to other tree models. Experimental studies show that the regularized trees can select high-quality feature subsets with regard to both strong and weak classifiers. Because tree models can naturally deal with categorical and numerical variables, missing values, different scales between variables, interactions and nonlinearities etc., the tree regularization framework provides an effective and efficient feature selection solution for many practical problems.

AB - We propose a tree regularization framework, which enables many tree models to perform feature selection efficiently. The key idea of the regularization framework is to penalize selecting a new feature for splitting when its gain (e.g. information gain) is similar to the features used in previous splits. The regularization framework is applied on random forest and boosted trees here, and can be easily applied to other tree models. Experimental studies show that the regularized trees can select high-quality feature subsets with regard to both strong and weak classifiers. Because tree models can naturally deal with categorical and numerical variables, missing values, different scales between variables, interactions and nonlinearities etc., the tree regularization framework provides an effective and efficient feature selection solution for many practical problems.

KW - RBoost

KW - RRF

KW - regularized boosted trees

KW - regularized random forest

KW - tree regularization

UR - http://www.scopus.com/inward/record.url?scp=84865067900&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84865067900&partnerID=8YFLogxK

U2 - 10.1109/IJCNN.2012.6252640

DO - 10.1109/IJCNN.2012.6252640

M3 - Conference contribution

AN - SCOPUS:84865067900

SN - 9781467314909

T3 - Proceedings of the International Joint Conference on Neural Networks

BT - 2012 International Joint Conference on Neural Networks, IJCNN 2012

T2 - 2012 Annual International Joint Conference on Neural Networks, IJCNN 2012, Part of the 2012 IEEE World Congress on Computational Intelligence, WCCI 2012

Y2 - 10 June 2012 through 15 June 2012

ER -

Feature selection via regularized trees

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this