Advanced SearchSearch Tips
Classification of Time-Series Data Based on Several Lag Windows
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
 Title & Authors
Classification of Time-Series Data Based on Several Lag Windows
Kim, Hee-Young; Park, Man-Sik;
  PDF(new window)
In the case of time-series analysis, it is often more convenient to rely on the frequency domain than the time domain. Spectral density is the core of the frequency-domain analysis that describes autocorrelation structures in a time-series process. Possible ways to estimate spectral density are to compute a periodogram or to average the periodogram over some frequencies with (un)equal weights. This can be an attractive tool to measure the similarity between time-series processes. We employ the metrics based on a smoothed periodogram proposed by Park and Kim (2008) for the classification of different classes of time-series processes. We consider several lag windows with unequal weights instead of a modified Daniel's window used in Park and Kim (2008). We evaluate the performance under various simulation scenarios. Simulation results reveal that the metrics used in this study split the time series into the preassigned clusters better than do the raw-periodogram based ones proposed by Caiado et al. 2006. Our metrics are applied to an economic time-series dataset.
Clustering;autoregressive model;moving-average model;smoothed periodogram;nonstationary time series;
 Cited by
Baker, F. B. and Hubert, L. J. (1975). Measuring the power of hierarchical cluster analysis, Journal of the American Statistical Association, 70, 31-38. crossref(new window)

Bohte, Z., Cepar, D. and Kosmelij, K. (1980). Clustering of time series, In Proceedings of COMPSTAT, 587-593.

Brillinger, D. (1981). Time Series: Data Analysis and Theory, Holden-Day, San Francisco.

Brockwell, P. J. and Davis, R. A. (1991). Time Series: Theory and Methods, Springer-Verlag, New York.

Caiado, J., Crato, N. and Pena, D. (2006). A periodogram-based metric for time series classification, Computational Statistics and Data Analysis, 50, 2668-2684. crossref(new window)

Chatfield, C. (1975). The Analysis of Time Series: Theory and practice, Chapman & Hall, London.

Chen, G., Abraham, B. and Peiris, S. (1994). Lag window estimation of the degree of differencing in fractionally integrated time series models, Journal of Time Series Analysis, 15, 473-487. crossref(new window)

Corduas, M. and Piccolo, D. (2008). Time series clustering and classification by the autoregressive metric, Computational Statistics and Data Analysis, 52, 1860-1872. crossref(new window)

Cowpertwait, P. S. P. and Cox, T. F. (1992). Clustering population means under heterogeneity of variance with an application to a rainfall time series problem, The Statistician, 41, 113-121. crossref(new window)

Galeano, P. and Pena, D. (2000). Multivariate analysis in vector time series, Resenhas, 4, 383-403.

Golay, X., Kollias, S., Stoll, G., Meier, D., Valvanis, A. and Boesiger, P. (1998). A new correlation-based fuzzy logic clustering algorithm for fMRI, Magnetic Resonance in Medicine, 40, 249-260. crossref(new window)

Goutte, C., Toft, P., Rostrup, E., Nielsen, F. A. and Hansen, L. K. (1999). On clustering fMRI time series, Neuroimage, 9, 298-310. crossref(new window)

Kakizawa, Y., Shumway, R. H. and Taniguchi, M. (1998). Discrimination and clustering for multivariate time series, Journal of American Statstical Association, 93, 328-340. crossref(new window)

Kovacic, Z. J. (1996). Classification of time series with applications to the leading indicator selection, In Proceedings of the Fifth Conference of IFCS, 2, 204-207.

Kullback, S. (1978). Information Theory and Statistics, Peter Smith, Gloucester, Massachusetts.

Kullback, S. and Leibler, R. A. (1951). On information and sufficiency, Annals of Mathematical Statistics, 22, 79-86. crossref(new window)

Macchiato, M., La Rotonda, L., Lapenna, V. and Ragosta, M. (1995). Time modelling and spatial clustering of daily ambient temperature an application in Southern Italy, Environmetrics, 6, 31-53. crossref(new window)

Maharaj, E. A. (2000). Cluster of time series, Journal of Classification, 17, 297-314. crossref(new window)

Park, M. S. and Kim, H.-Y. (2008). Classification of precipitation data based on smoothed periodogram, The Korean Journal of Applied Statistics, 21, 547-560. crossref(new window)

Pena, D. and Poncela, P. (2006). Nonstationary dynamic factor models, Journal of Statistical Planning and Inference, 136, 1237-1257. crossref(new window)

Piccolo, D. (1990). A distance measure for classifying ARIMA models, Journal of Time Series Analysis, 11, 153-164. crossref(new window)

Priestley, M. B. (1981). Spectral Analysis and Time Series, Academic Press, New York.

R Development Core Team (2006). R: A Language and Environment for Statistical Computing, Vienna, Austria: R Foundation for Statistical Computing. ISBN 3-900051-07-0.

Shumway, R. H. (2003). Time-frequency clustering and discriminant analysis, Statistics and Probability Letters, 63, 307-314. crossref(new window)

Wismuller, A., Lange, O., Dersch, D. R., Leinsinger, G. L., Hahn, K., Putz, B. and Auer, D. (2002). Cluster analysis of biomedical image time-series, International Journal of Computer Vision, 46, 103-128. crossref(new window)