Advanced SearchSearch Tips
Visualizing Multi-Variable Prediction Functions by Segmented k-CPG's
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
 Title & Authors
Visualizing Multi-Variable Prediction Functions by Segmented k-CPG's
Huh, Myung-Hoe;
  PDF(new window)
Machine learning methods such as support vector machines and random forests yield nonparametric prediction functions of the form y = . As a sequel to the previous article (Huh and Lee, 2008) for visualizing nonparametric functions, I propose more sensible graphs for visualizing y = herein which has two clear advantages over the previous simple graphs. New graphs will show a small number of prototype curves of , revealing statistically plausible portion over the interval of which changes with (). To complement the visual display, matching importance measures for each of p predictor variables are produced. The proposed graphs and importance measures are validated in simulated settings and demonstrated for an environmental study.
Visualization of prediction functions;k-Means clustering;variable importance;support vector machine;random forests;environmental data;
 Cited by
Visualizing SVM Classification in Reduced Dimensions,Huh, Myung-Hoe;Park, Hee-Man;

Communications for Statistical Applications and Methods, 2009. vol.16. 5, pp.881-889 crossref(new window)
Breiman, L. (2001). Random forests, Machine Learning, 45, 5-32 crossref(new window)

Breiman, L. and Friedman, J. (1985). Estimating optimal transformations for multiple regression and correlation, Journal of the American Statistical Association, 80, 580-598 crossref(new window)

Hastie, T., Tibshirani, R. and Friedman, J. (2001). The Elements of Statistical Learning, Springer, New York

Huh, M. H. and Lee, Y. (2008). Simple graphs for complex prediction functions, Communications of the Korean Statistical Society, 15, 343-351 crossref(new window)

Strobl, C., Boulesteix, A., Kneib., T., Augustin, T. and Zeileis, A. (2008). Conditioning variable importance for random forests, BMC Bioinformatics, 9, 307 crossref(new window)