The Prefix Array for Multimedia Information Retrieval in the Real-Time Stenograph

실시간 속기 자막 환경에서 멀티미디어 정보 검색을 위한 Prefix Array

  • Published : 2006.10.27

Abstract

This paper proposes an algorithm and its data structure to support real-time full-text search for the streamed or broadcasted multimedia data containing real-time stenograph text. Since the traditional indexing method used at information retrieval area uses the linguistic information, there is a heavy cost. Therefore, we propose the algorithm and its data structure based on suffix array, which is a simple data structure and has low space complexity. Suffix array is useful frequently to search for huge text. However, subtitle text of multimedia data is to get longer by time. Therefore, suffix array must be reconstructed because subtitle text is continually changed. We propose the data structure called prefix array and search algorithm using it.

Keywords