|Title||Robust model selection using fast and robust bootstrap|
|Publication Type||Journal Article|
|Year of Publication||2008|
|Authors||Salibian-Barrera, M, Van Aelst, S|
|Journal||COMPUTATIONAL STATISTICS & DATA ANALYSIS|
|Date Published||AUG 15|
|Type of Article||Article|
Robust model selection procedures control the undue influence that outliers can have on the selection criteria by using both robust point estimators and a bounded loss function when measuring either the goodness-of-fit or the expected prediction error of each model. Furthermore, to avoid favoring over-fitting models, these two measures can be combined with a penalty term for the size of the model. The expected prediction error conditional oil the observed data may be estimated using the bootstrap. However, bootstrapping robust estimators becomes extremely time consuming on moderate to high dimensional data sets. It is shown that the expected prediction error can be estimated using a very fast and robust bootstrap method, and that this approach yields a consistent model selection method that is computationally feasible even for a relatively large number of covariates. Moreover, as opposed to other bootstrap methods, this proposal avoids the numerical problems associated with the small bootstrap samples required to obtain consistent model selection criteria. The finite-sample performance of the fast and robust bootstrap model selection method is investigated through a simulation study while its feasibility and good performance on moderately large regression models are illustrated on several real data examples. (c) 2008 Elsevier B.V. All rights reserved.