DOI QR코드

DOI QR Code

Comparison Study of Time Series Clustering Methods

시계열자료 눈집방법의 비교연구

  • Received : 20091100
  • Accepted : 20091100
  • Published : 2009.12.31

Abstract

In this paper we introduce the time series clustering methods in the time and frequency domains and discuss the merits or demerits of each method. We analyze 15 daily stock prices of KOSPI 200, and the nonparametric method using the wavelet shows the best clustering results. For the clustering of nonstationary time series using the spectral density, the EMD method remove the trend more effectively than the differencing.

본 논문에서는 시계열자료의 군집분석을 위해 시간영역과 진동수영역에서의 군집 방법들을 소개하고 각 방법들의 장단점에 대해 논의하였다. KOSPI 200에 속한 15개 기업의 일별 주가자료률 이용한 비교분석 결과 비모수적인 방법인 웨이블릿을 이용한 군집분석이 가장 좋은 결과를 보였다. 비정상 시계열자료의 경우 차분 보다는 EMD를 이용하여 추세를 제거하는 방법이 스펙트럼 밀도함수를 이용한 군집분석에 더 효율적이었다.

Keywords

References

  1. 김유진 (2009). Comparison study of time series methods, Master thesis, Seoul National University
  2. 박민정 (2009). Time-Frequency analysis by multiscale methods with applications to bat signals, Ph.D thesis, Seoul National University
  3. 이정현 (2008). Clustering analysis of financial time series using wavelet analysis, Master thesis, Seoul National University
  4. Angelini, L., De Carlo, F., Marangi, C., Pellicoro, M. and Stramaglia, S. (2000). Clustering data by inhomo-geneous chaotic map lattices, Physical Review Letters, 85, 554-557 https://doi.org/10.1103/PhysRevLett.85.554
  5. Basalto, N., Bellotti, R., De Carlo, F., Facchi, P. and Pascazio, S. (2005). Clustering stock market companies via chaotic map synchronization, Physica A: Statistical Mechanics and its Applications, 345, 196-206 https://doi.org/10.1016/j.physa.2004.07.034
  6. Blatt, M., Domany, E. and Wiseman, S. (1996). Super-paramagnetic clustering of data, Physical Review Letters, 76, 3251-3254 https://doi.org/10.1103/PhysRevLett.76.3251
  7. Chatfield, C. (1979). Inverse autocorrelations, Journal of Royal Statistical Society Series A, 142, 363-377 https://doi.org/10.2307/2982488
  8. Flandrin, F., Rilling, G. and Goncalves, P. (2004). Empirical mode decomposition as a filter bank, IEEE Signal Processing Letters, 11, 112-114 https://doi.org/10.1109/LSP.2003.821662
  9. Galeano, P. and Pena, D. (2000). Multivariate analysis in vector time series, Resenhas, 4, 383-404
  10. Huang, N. E., Shen, Z., Long, S. R., Wu, M. C., Shih, H. H., Zheng, Q., Yen, N. C., Tung, C. C. and Liu, H. H. (1998). The empirical mode decomposition and Hilbert spectrum for nonlinear and nonstationary time series, Proceedings of the Royal Society London A, 454, 903-995 https://doi.org/10.1098/rspa.1998.0193
  11. Huhtala, Y., Karkkainen, J. and Toivonen, H. (1999). Mining for similarities in aligned time series using wavelets, Data Mining and Knowledge Discovery: Theory, Tools, and Technology, Proceedings of SPIEE, 3695, 150-160 https://doi.org/10.1117/12.339977

Cited by

  1. A study on electricity demand forecasting based on time series clustering in smart grid vol.29, pp.1, 2016, https://doi.org/10.5351/KJAS.2016.29.1.193