DOI QR코드

DOI QR Code

A comparison of imputation methods for the consecutive missing temperature data

연속적 결측이 존재하는 기온 자료에 대한 결측복원 기법의 비교

  • Received : 2016.03.23
  • Accepted : 2016.03.31
  • Published : 2016.04.30

Abstract

Consecutive missing values are likely to occur in long climate data due to system error or defective equipment. Furthermore, it is difficult to impute missing values. However, these complicated problems can be overcame by imputing missing values with reference time series. Reference time series must be composed of similar time series to time series that include missing values. We performed a simulation to compare three missing imputation methods (the adjusted normal ratio method, the regression method and the IDW method) to complete the missing values of time series. A comparison of the three missing imputation methods for the daily mean temperatures at 14 climatological stations indicated that the IDW method was better thanx others at south seaside stations. We also found the regression method was better than others at most stations (except south seaside stations).

Keywords

consecutive missing value;missing value imputation;adjusted normal ratio methods;regression method;IDW method

References

  1. Azman, M. A., Zakaria, R., and Radi, N. F. A. (2015). Estimation of missing rainfall data in Pahang using modified spatial interpolation weighting methods, The 2nd ISM International Statistical Conferencd 2014 (ISM-II): Empowering the Applications of Statistical and Mathematical Sciences, 1643, 65-72
  2. Di Piazza, A., Lo Conti, F., Noto, L. V., Viola, F., and La Loggia, G. (2011). Comparative analysis of different techniques for spatial interpolation of rainfall data to create a serially complete monthly time series of precipitation for Sicily, Italy, International Journal of Applied Earth Observation and Geoinformation, 13, 396-408. https://doi.org/10.1016/j.jag.2011.01.005
  3. Durre, I., Menne, M. J., Gleason, B. E., Houston, T. G., and Vose, R. S. (2010). Comprehensive automated quality assurance of daily surface observations, National Climatic Data Center, 49, 1615-1633.
  4. Jung, S.-Y. (2014). A study of consecutive missing value imputation method using reference series in time series, M.S. Thesis, Department of Statistics, Graduate School of Dongguk University, Seoul, Korea.
  5. Lee, Y.-S. (2003). Data Mining Cookbook by Olivia Parr Rud, Kyowoo Publishing Company, Seoul.
  6. Legates, D. R., and McCabe Jr., G. J. (1999). Evaluating the use of "goodness-of -fit" measures in hydrologic and hydroclimatic model evaluation, Water Resources Research, 35, 233-241. https://doi.org/10.1029/1998WR900018
  7. Paulhus, J. L. H. and Kohler, M. A. (1952). Interpolation of missing precipitation records, Monthly Weather Review, 80, 129-133. https://doi.org/10.1175/1520-0493(1952)080<0129:IOMPR>2.0.CO;2
  8. Teegavarapu, R. S. V. and Chandramouli, V. (2005). Improved weighting methods, deterministic and stochastic data-driven models for estimation of missing precipitation records, Journal of Hydrology, 312, 191-206. https://doi.org/10.1016/j.jhydrol.2005.02.015
  9. You, J. S., Hubbard, K. G., and Goddard, S. (2008). Comparison of methods for spatially estimating station temperatures in a quality control system, International Journal of Climatology, 28, 777-787. https://doi.org/10.1002/joc.1571
  10. Young, K. (1992). A three-way model for interpolating for monthly precipitation values, Monthly Weather Review, 120, 2561-2569. https://doi.org/10.1175/1520-0493(1992)120<2561:ATWMFI>2.0.CO;2

Acknowledgement

Supported by : Korea Meteorological Administration