DOI QR코드

DOI QR Code

An Alternative Parametric Estimation of Sample Selection Model: An Application to Car Ownership and Car Expense

비정규분포를 이용한 표본선택 모형 추정: 자동차 보유와 유지비용에 관한 실증분석

  • Received : 2011.10.06
  • Accepted : 2012.03.19
  • Published : 2012.05.31

Abstract

In a parametric sample selection model, the distribution assumption is critical to obtain consistent estimates. Conventionally, the normality assumption has been adopted for both error terms in selection and main equations of the model. The normality assumption, however, may excessively restrict the true underlying distribution of the model. This study introduces the $S_U$-normal distribution into the error distribution of a sample selection model. The $S_U$-normal distribution can accommodate a wide range of skewness and kurtosis compared to the normal distribution. It also includes the normal distribution as a limiting distribution. Moreover, the $S_U$-normal distribution can be easily extended to multivariate dimensions. We provide the log-likelihood function and expected value formula based on a bivariate $S_U$-normal distribution in a sample selection model. The results of simulations indicate the $S_U$-normal model outperforms the normal model for the consistency of estimators. As an empirical application, we provide the sample selection model for car ownership and a car expense relationship.

표본선택 모형을 최우추정법으로 추정할 때 오차항의 분포를 제대로 가정하는 것이 매우 중요하다. 표본선택 모형의 선택 방정식과 본 방정식의 오차항 분포를 일반적으로 이변량 정규분포로 가정하지만, 이 가정이 오차항의 실제 분포를 과도하게 제약할 가능성이 있다. 본 연구는 표본선택 모형의 오차항 분포로 $S_U$-정규분포를 도입한다. $S_U$-정규분포는 분포의 비대칭성과 초과첨도를 허용한다는 측면에서 정규분포보다 훨씬 유연하면서, 동시에 정규분포를 극한분포의 형태로 포함하고 있다. 또한 정규분포처럼 다변량 분포함수가 존재하기 때문에 표본선택 모형과 같은 다변량 모형에서도 활용할 수 있다. 본 논문은 $S_U$-정규분포를 이용한 표본선택 모형에서 로그우도 함수와 조건부 기댓값을 도출하고, 시뮬레이션을 통해 정규분포 모형과 추정성과를 비교한다. 또한 자동차 보유 가구들의 자동차 유지비에 관한 실제 데이터를 이용하여 $S_U$-정규분포 표본선택 모형의 추정결과를 제시한다.

Keywords

References

  1. 최필선, 민인식 (2009). Further applications of Johnson's S U-normal distribution to various regression models, Communications of the Korean Statistical Society, 15, 1-11.
  2. Bollerslev, T. (1987). A conditionally heteroskedastic time series model for speculative prices and rates of return, Review of Economics and Statistics, 69, 542-547. https://doi.org/10.2307/1925546
  3. Choi, P. and Min, I. (2009). Estimating endogenous switching regression model with a flexible parametric distribution function: Application to Korean housing demand, Applied Economics, 41, 3045-3055. https://doi.org/10.1080/00036840701335595
  4. Choi, P. and Min, I. (2011). A comparison of conditional and unconditional approaches in Value-at-Risk estimation, Japanese Economic Review, 62, 99-115. https://doi.org/10.1111/j.1468-5876.2010.00456.x
  5. Heckman, J. (1979). Sample selection bias as a specification error, Econometrica, 47, 153-161. https://doi.org/10.2307/1912352
  6. Johnson, N. (1949a). Systems of frequency curves generated by method of translation, Biometrika, 36, 149-176. https://doi.org/10.1093/biomet/36.1-2.149
  7. Johnson, N. (1949b). Bivariate distributions based on simple translation systems, Biometrika, 36, 297-304. https://doi.org/10.1093/biomet/36.3-4.297
  8. Klaauw, B. and R. Koning (2003). Testing the normality assumption in the sample selection model with an application to travel demand, Journal of Business and Economic Statistics, 21, 31-42. https://doi.org/10.1198/073500102288618739
  9. Manski, C. (1989). Anatomy of the selection problem, Journal of Human Resources, 24, 343-360. https://doi.org/10.2307/145818
  10. Vythoulkas, P. (2007). Car ownership and household transport expenditure in Greece, Working paper, National Technical University of Athens.
  11. Wooldridge, J. (2002). Econometric Analysis of Cross Section and Panel Data, MIT Press, London, England.