DOI QR코드

DOI QR Code

Usage of auxiliary variable and neural network in doubly robust estimation

  • Park, Hyeonah (Department of Statistics, Seoul National University) ;
  • Park, Wonjun (Department of Statistics, Seoul National University)
  • Received : 2013.04.03
  • Accepted : 2013.05.13
  • Published : 2013.05.31

Abstract

If the regression model or the propensity model is correct, the unbiasedness of the estimator using doubly robust imputation can be guaranteed. Using a neural network instead of a logistic regression model for the propensity model, the estimators using doubly robust imputation are approximately unbiased even though both assumed models fail. We also propose a doubly robust estimator of ratio form using population information of an auxiliary variable. We prove some properties of proposed theory by restricted simulations.

Keywords

References

  1. Carpenter, J. R. and Kenward, M. G. (2006). A comparison of multiple imputation and doubly robust estimation for analyses with missing data. Journal of the Royal Statistical Society A, 169, 571-584. https://doi.org/10.1111/j.1467-985X.2006.00407.x
  2. Cao, W., Tsiatis, A. A. and Davidian, M. (2009). Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data. Biometrika, 96, 723-734. https://doi.org/10.1093/biomet/asp033
  3. Cho, K. H. and Park, H. C. (2012). A study on decision tree creation using marginally conditional variables. Journal of the Korean Data & Information Science Society, 23, 299-307. https://doi.org/10.7465/jkdi.2012.23.2.299
  4. Ekholm, A. and Laaksonen, S. (1991). Weighting via response modeling in the finnish household budget survey. Journal of Official statistics, 7, 325-337.
  5. Groves, R., Dillman, D., Eltinge, J. and Little, R. J. A. (2002). Survey nonresponse, Wiley, New York.
  6. Hastie, T., Tibshirani, R. and Friedman, J. (2009). The element of statistical learning: Data mining, inference, and prediction, 2nd edition, Springer, New York.
  7. Horvitz, D. G. and Thompson, D. J. (1952). A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Society, 47, 663-685. https://doi.org/10.1080/01621459.1952.10483446
  8. Iannacchione, V. G. (2003). Sequential weight adjustment for location and cooperation propensity for the 1995 national survey of family growth. Journal of Official statistics, 19, 31-43.
  9. Izenman, A. J. (2008). Modern multivariate statistical techniques: Regression, classification, and manifold learning, Springer, New York.
  10. Kalton, G. and Kasprzyk, D. (1986). The treatment of missing survey data. Survey Methodology, 12, 1-16.
  11. Kim, J. K. and Park, H. (2006). Imputation using response probability. The Canadian Journal of Statistics, 34, 171-182. https://doi.org/10.1002/cjs.5550340112
  12. Little, R. J. A. (1986). Survey nonresponse adjustments for estimates of means. International Statistical Review, 54, 139-157. https://doi.org/10.2307/1403140
  13. Little, R. J. A. and Rubin, D. B. (2002). Statistical analysis with missing data, Wiley, New York.
  14. Park, H., Jeon, J. and Na, S. (2011). Doubly robust imputation using auxiliary information. Communications of the Korean Statistical society, 18, 47-55. https://doi.org/10.5351/CKSS.2011.18.1.047
  15. Qin, J., Shao, J. and Zhang, B. (2008). Efficient and doubly robust imputation for covariate-dependent missing responses. Journal of the American Statistical Association, 103, 797-810. https://doi.org/10.1198/016214508000000238
  16. Rao, J. N. K. and Sitter, R. R. (1995). Variance estimation under two-phase sampling with application to imputation for missing data. Biometrika, 82, 453-460. https://doi.org/10.1093/biomet/82.2.453
  17. Rao, J. N. K. (1996). On variance estimation with imputed survey data. Journal of the American Statistical Association, 91, 499-506. https://doi.org/10.1080/01621459.1996.10476910
  18. Robins, J. M, Rotnitzky, A. and Zhao, L. P. (1994). Estimation of regression coefficients when some regressors are not always observed. Journal of the American Statistical Association, 89, 846-866. https://doi.org/10.1080/01621459.1994.10476818
  19. Rosenbaum, P. R. (1987). Model-based direct adjustment. Journal of the American Statistical Association, 82, 387-394. https://doi.org/10.1080/01621459.1987.10478441

Cited by

  1. Estimation to improve survey efficiency in callback vol.26, pp.2, 2015, https://doi.org/10.7465/jkdi.2015.26.2.377
  2. Estimation using response probability when missing data happen on the second occasion vol.25, pp.1, 2014, https://doi.org/10.7465/jkdi.2014.25.1.263