Type of Publication: Journal Articles
Authors: Asaf Shabtai,
Title: ConfDTree: a statistical method for improving decision trees
Name of the Journal: J. Comput. Sci. Technol. (USA)
Year: 2014
Volume: 29
Issue: 3
Pages: 392 - 407
Abstract: Decision trees have three main disadvantages: reduced performance when the training set is small; rigid decision criteria; and the fact that a single “uncharacteristic” attribute might “derail” the classification process. In this paper we present ConfDTree (Confidence-Based Decision Tree) - a post-processing method that enables decision trees to better classify outlier instances. This method, which can be applied to any decision tree algorithm, uses easy-to-implement statistical methods (confidence intervals and two-proportion tests) in order to identify hard-to-classify instances and to propose alternative routes. The experimental study indicates that the proposed post-processing method consistently and significantly improves the predictive performance of decision trees, particularly for small, imbalanced or multi-class datasets in which an average improvement of 5%~9% in the AUC performance is reported.
Keywords: decision trees;pattern classification;statistical analysis; ,
Last Updated: 1/13/2016 12:00:00 AM
Powered by Rami Palombo © 2005
Search in: Google Scholar  |  Scitation