Neural Comp. Sign up for ETOCS
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Cherkassky, V.
Right arrow Articles by Ma, Y.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Cherkassky, V.
Right arrow Articles by Ma, Y.
(Neural Computation. 2003;15:1691-1714.)
© 2003 The MIT Press


Letter

Comparison of Model Selection for Regression

Vladimir Cherkassky

cherkass{at}ece.umn.edu Department of Electrical and Computer Engineering, University of Minnesota, Minneapolis, MN 55455, U.S.A.

Yunqian Ma

myq{at}ece.umn.edu Department of Electrical and Computer Engineering, University of Minnesota, Minneapolis, MN 55455, U.S.A.

We discuss empirical comparison of analytical methods for model selection. Currently, there is no consensus on the best method for finite-sample estimation problems, even for the simple case of linear estimators. This article presents empirical comparisons between classical statistical methods—Akaike information criterion (AIC) and Bayesian information criterion (BIC)—and the structural risk minimization (SRM) method, based on Vapnik-Chervonenkis (VC) theory, for regression problems. Our study is motivated by empirical comparisons in Hastie, Tibshirani, and Friedman (2001), which claims that the SRM method performs poorly for model selection and suggests that AIC yields superior predictive performance. Hence, we present empirical comparisons for various data sets and different types of estimators (linear, subset selection, and k-nearest neighbor regression). Our results demonstrate the practical advantages of VC-based model selection; it consistently outperforms AIC for all data sets. In our study, SRM and BIC methods show similar predictive performance. This discrepancy (between empirical results obtained using the same data) is caused by methodological drawbacks in Hastie et al. (2001), especially in their loose interpretation and application of SRM method. Hence, we discuss methodological issues important for meaningful comparisons and practical application of SRM method. We also point out the importance of accurate estimation of model complexity (VC-dimension) for empirical comparisons and propose a new practical estimate of model complexity for k-nearest neighbors regression.




This article has been cited by other articles:


Home page
Br J Philos SciHome page
D. L. Dowe, S. Gardner, and G. Oppy
Bayes not Bust! Why Simplicity is no Problem for Bayesians
Brit J Philos Sci, December 1, 2007; 58(4): 709 - 754.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
T. Hastie, R. Tibshirani, and J. Friedman
Note on "Comparison of Model Selection for Regression" by Vladimir Cherkassky and Yunqian Ma
Neural Comput., July 1, 2003; 15(7): 1477 - 1480.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
J COGNITIVE NEUROSCIENCE NEURAL COMPUTATION MIT PRESS JOURNALS
Copyright © 2003 by The MIT Press.