DOI QR코드

DOI QR Code

Cluster Analysis of Daily Electricity Demand with t-SNE

  • Min, Yunhong (Graduate School of Logistics, Incheon National University)
  • Received : 2018.03.29
  • Accepted : 2018.05.25
  • Published : 2018.05.31

Abstract

For an efficient management of electricity market and power systems, accurate forecasts for electricity demand are essential. Since there are many factors, either known or unknown, determining the realized loads, it is difficult to forecast the demands with the past time series only. In this paper we perform a cluster analysis on electricity demand data collected from Jan. 2000 to Dec. 2017. Our purpose of clustering on electricity demand data is that each cluster is expected to consist of data whose latent variables are same or similar values. Then, if properly clustered, it is possible to develop an accurate forecasting model for each cluster separately. To validate the feasibility of this approach for building better forecasting models, we clustered data with t-SNE. To apply t-SNE to time series data effectively, we adopt the dynamic time warping as a similarity measure. From the result of experiments, we found that several clusters are well observed and each cluster can be interpreted as a mix of well-known factors such as trends, seasonality and holiday effects and other unknown factors. These findings can motivate the approaches which build forecasting models with respect to each cluster independently.

Keywords

References

  1. Korea Power Exchange, http://www.kpx.or.kr
  2. P.J. Brockwell and R.A. Davis, "Introduction to Time Series and Forecasting" Springer, 2016.
  3. D. Park, S.H. Yoon, "Clustering and classification to characterize daily electricity demand," Journal of the Korean Data & Information Science Society, Vol. 28, No. 2, pp. 395-406, March 2017. https://doi.org/10.7465/jkdi.2017.28.2.395
  4. J.H. Lim, S.Y. Kim, J.D. Park, K.B. Song, "Representative temperature assessment for improvement of short-term load forecasting accuracy," Journal of the Korean Institute of Illuminating and Electrical Installation Engineers, Vol. 27, No. 6, pp. 39-43, June 2013. https://doi.org/10.5207/JIEIE.2013.27.6.039
  5. S.H. Yoon, Y.J. Choi, "Functional clustering for electricity demand data: A case study," Journal of the Korean Data & Information Science Society, Vol. 26, No. 4, pp. 885-894, July 2015. https://doi.org/10.7465/jkdi.2015.26.4.885
  6. L.J.P. van der Maaten and G.E. Hinton, "Visualizing high-dimensional data using t-SNE," Journal of Machine Learning Research, Vol. 9, pp. 2579-2695, Nov 2008.
  7. G.E. Hinton and S.T. Roweis, "Stochastic neighbor embedding," Proceedings of Advances in Neural Information Processing Systems (NIPS), pp. 833-840, 2002.
  8. J.Kruskal and M. Liberman, "The symmetric time warping problem: From continuous to discrete," Proceedings of Time Waprs, String Edits and Macromolecules: The Theory and Practice of Sequence Comparison, pp. 125-161, 1983.
  9. S. Salvador and P. Chan, "Toward accurate dynamic time warping in linear time and space," Intelligent Data Analysis, Vol. 11, No. 5, pp. 561-580, Oct. 2007.
  10. N.V. Prasad and S. Umesh, "Improved cepstral mean and variance normalization using Bayesian framework," Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp. 156-161, Dec. 2013.