DOI QR코드

DOI QR Code

Estimation for misclassified data with ultra-high levels

  • Kang, Moonsu (Department of Information Statistics, Gangneung-Wonju National University)
  • Received : 2015.08.04
  • Accepted : 2016.01.15
  • Published : 2016.01.31

Abstract

Outcome misclassification is widespread in classification problems, but methods to account for it are rarely used. In this paper, the problem of inference with misclassified multinomial logit data with a large number of multinomial parameters is addressed. We have had a significant swell of interest in the development of novel methods to infer misclassified data. One simulation study is shown regarding how seriously misclassification issue occurs if the number of categories increase. Then, using the group lasso regression, we will show how the best model should be fitted for that kind of multinomial regression problems comprehensively.

Keywords

References

  1. Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society B (Methodological), 58, 267-288.
  2. Bross, I. (1954). Misclassi cation in 2 by 2 tables, Biometrics, 10, 478-486. https://doi.org/10.2307/3001619
  3. Tenenbein, A. (1972). A double sampling scheme for estimating from misclassi ed multinomial data with application to sampling inspection. Technometrics, 10, 187-202.
  4. Viana, M. A. G. (1994). Bayesian small-sample estimation of misclassi ed multinomial data. Biometrics, 50, 237-243. https://doi.org/10.2307/2533215
  5. Chen, T. T. (1989). A review of methods for misclassi ed categorical data in epidemiology. Statistics in Medicine, 8, 1095-1106. https://doi.org/10.1002/sim.4780080908
  6. Ekholm, A. and Palmgren, J. (1987). Correction for misclassification using doubly sampled data. Journal of Ocial Statistics, 3, 419-429.
  7. Meier, L., Geer, S. V. D. and Buhlmann, P. (2008). The group lasso for logistic regression. Journal of the Royal Statistical Society B (Statistical Methodology), 70, 53-71. https://doi.org/10.1111/j.1467-9868.2007.00627.x
  8. Songyong, S. and Heemo, K. (2014). A polychotomous regression model with tensor product splines and direct sums. Journal of the Korean Data & Information Science Society, 25, 19-26. https://doi.org/10.7465/jkdi.2014.25.1.19
  9. SangIn, L. (2015). A note on standardization in penalized regressions=A note on standardization in penalized regressions. Journal of the Korean Data and Information Science Society, 26, 505-516. https://doi.org/10.7465/jkdi.2015.26.2.505