DOI QR코드

DOI QR Code

다중속성 시계열 데이타베이스의 효율적인 유사 검색

Efficient Similarity Search in Multi-attribute Time Series Databases

  • 발행 : 2007.12.31

초록

시계열에 대한 색인 및 검색 연구는 하나의 속성으로 구성된 시계열에 대하여 주로 수행되어 왔다. 그러나 음악, 비디오 등의 멀티미디어 데이타베이스는 다중속성 시계열 데이타베이스에서 유사 검색을 다룰 수 있어야 한다. 기존의 다중속성 시계열 데이타베이스에 대한 연구는 두 다중속성 시퀀스간의 유사도로 속성 간의 거리의 누적을 사용하고 있기에, 개별적인 속성 시퀀스에 대한 정보를 상실하게 된다. 본 연구에서는 이러한 문제를 해결하기 위해 속성 시퀀스 측면에서 다중속성 시계열 데이타베이스의 유사검색 기법을 제안한다. 제안된 기법은 검색 공간을 효율적으로 줄일 수 있으며, 착오 누락이 없음을 보장한다. 또한 실험을 통해 제안된 기법의 성능 향상을 확인하였다.

Most of previous work on indexing and searching time series focused on the similarity matching and retrieval of one-attribute time series. However, multimedia databases such as music, video need to handle the similarity search in multi-attribute time series. The limitation of the current similarity models for multi-attribute sequences is that there is no consideration for attributes' sequences. The multi-attribute sequences are composed of several attributes' sequences. Since the users may want to find the similar patterns considering attributes's sequences, it is more appropriate to consider the similarity between two multi-attribute sequences in the viewpoint of attributes' sequences. In this paper, we propose the similarity search method based on attributes's sequences in multi-attribute time series databases. The proposed method can efficiently reduce the search space and guarantees no false dismissals. In addition, we give preliminary experimental results to show the effectiveness of the proposed method.

키워드

참고문헌

  1. Rakesh Agrawal, Christos Faloutsos and Arun N. Swami, 'Efficient Similarity Search in Sequence Databases,' Proceedings of the International Conference of Foundations of Data Organization and Algorithms, pp.69-84, 1993
  2. Antonin Guttman, 'R-trees: A Dynamic Index Structure for Spatial Searching,' Proceedings of ACM SIGMOD International Conference on Management of Data, pp.47-57, 1984
  3. Norbert Beckmann, Hans-Peter Kriegel, Ralf Schneider and Bernhard Seeger, 'The R*-tree: An Efficient and Robust Access Method for Points and Rectangles,' Proceedings of ACM SIGMOD International Conference on Management of Data, pp.322-331, 1990 https://doi.org/10.1145/93605.98741
  4. Byoung-Kee Yi, Christos Faloutsos, 'Fast Time Sequence Indexing for Arbitrary Lp Norms,' Proceedings of International Conference on Very Large Data Bases, pp. 385-394, 2000
  5. Sangjun Lee, Bumsoo Kim, Sukho Lee, 'Efficient Range Search Method for Multi-dimensional Sequence Databases,' KISS Journal, Vol. 26(5), pp.613-620, 1999
  6. Davood Rafiei, Alberto O. Mendelzon, 'Similarity-Based Queries for Time Series Data,' Proceedings of ACM SIGMOD International Conference on Management of Datae, pp.13-25, 1997 https://doi.org/10.1145/253260.253264
  7. Kin-pong Chan, Ada Wai-chee Fu, 'Efficient Time Series Matching by Wavelets,' Proceedings of International Conference on Data Engineering, pp.126-133, 1999 https://doi.org/10.1109/ICDE.1999.754915
  8. Flip Korn, H. V. Jagadish, Christos Faloutsos, 'Efficiently Supporting Ad Hoc Queries in Large Datasets of Time Sequences,' Proceedings of ACM SIGMOD International Conference on Management of Data, pp.289-300, 1997 https://doi.org/10.1145/253260.253332
  9. Eamonn J. Keogh, Kaushik Chakrabarti, Sharad Mehrotra, Michael J. Pazzani, 'Locally Adaptive Dimensionality Reduction for Indexing Large Time Series Databases,' Proceedings of ACM SIGMOD International Conference on Management of Data, pp.151-162, 2001 https://doi.org/10.1145/376284.375680
  10. Byoung-Kee Yi, H. V. Jagadish, Christos Faloutsos, 'Efficient Retrieval of Similar Time Sequences Under Time Warping,' Proceedings of International Conference on Data Engineering, pp.201-208, 1998 https://doi.org/10.1109/ICDE.1998.655778
  11. Sangwook Kim, Sanghyun Park and W. Chu, 'An Index-based Approach for Similarity Search Supporting Time Warping in Large Sequence Databases,' Proceedings of International Conference on Data Engineering, pp.607-614, 2001 https://doi.org/10.1109/ICDE.2001.914875
  12. Eamonn J. Keogh, 'Exact Indexing of Dynamic Time Warping,' Proceedings of International Conference on Very Large Data Bases, pp.406-417, 2002
  13. Sanghyun Park, Wesley W. Chu, Jeehee Yoon, Chihcheng Hsu, 'Efficient Searches for Similar Subsequences of Different Lengths in Sequence Databases,' Proceedings of International Conference on Data Engineering, pp.23-32, 2000
  14. Seok-Lyong Lee, Seok-Ju Chun, Deok-Hwan Kim, Ju-Hong Lee and Chin-Wan Chung, 'Similarity Search for Multidimensional Data Sequences,' Proceedings of International Conference on Data Engineering, pp.599-608, 2000 https://doi.org/10.1109/ICDE.2000.839473
  15. Michail Vlachos, G.Kollios and Dimitrios Gunopulos, 'Discovering Similar Multidimensional Trajectories,' Proceedings of International Conference on Data Engineering, pp.673-684, 2002 https://doi.org/10.1109/ICDE.2002.994784
  16. Tamer Kahveci, Ambuj Singh and Aliekber Gurel, 'Similarity Searching for Multi-attribute Sequences,' Proceedings of International Conference on Scientific and Statistical Database Management, pp.175-184, 2002 https://doi.org/10.1109/SSDM.2002.1029718
  17. Joseph M. Hellerstein, Elias Koutsoupias, and Christos H. Papadimitriou, 'On the Analysis of Indexing Schemes,' Proceedings of ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, pp.249-256, 1997
  18. Ada Wai-Chee Fu, Eamonn J. Keogh, Leo Yung Hang Lau, Chotirat (Ann) Ratanamahatana, 'Scaling and Time Warping in Time Series Querying,' Proceedings of International Conference on Very Large Data Bases, pp.649-660, 2005
  19. Eamonn J. Keogh, Li Wei, Xiaopeng Xi, Sang-Hee Lee, Michail Vlachos, 'LB_Keogh Supports Exact Indexing of Shapes under Rotation Invariance with Arbitrary Representations and Distance Measures,' Proceedings of International Conference on Very Large Data Bases, pp.882-893, 2006
  20. Sang-Wook Kim, Dae-Hyun Park, Heon-Gil Lee, 'Efficient Processing of Subsequence Matching with the Euclidean Metric in Time-series Databases,' Information Process. Letters, Vol. 90(5), pp.253-260, 2004 https://doi.org/10.1016/j.ipl.2004.02.014
  21. Yang-Sae Moon and Jinho Kim, 'A Single Index Approach for Time-Series Subsequence Matching that Supports Moving Average Transform of Arbitrary Order,' Proceedings of Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp.739-749, 2006 https://doi.org/10.1007/11731139_86