Classification of TV Program Scenes Based on Audio Information

Lee, Kang-Kyu;Yoon, Won-Jung;Park, Kyu-Sik;

The Journal of the Acoustical Society of Korea

Volume 23 Issue 3E
/
Pages.91-97
/
2004
/
1225-4428(pISSN)

The Acoustical Society of Korea (한국음향학회)

Classification of TV Program Scenes Based on Audio Information

Lee, Kang-Kyu (Division of Information and Computer Science, Dankook University) ;
Yoon, Won-Jung (Division of Information and Computer Science, Dankook University) ;
Park, Kyu-Sik (Division of Information and Computer Science, Dankook University)

Published : 2004.09.01

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper, we propose a classification system of TV program scenes based on audio information. The system classifies the video scene into six categories of commercials, basketball games, football games, news reports, weather forecasts and music videos. Two type of audio feature set are extracted from each audio frame-timbral features and coefficient domain features which result in 58-dimensional feature vector. In order to reduce the computational complexity of the system, 58-dimensional feature set is further optimized to yield l0-dimensional features through Sequential Forward Selection (SFS) method. This down-sized feature set is finally used to train and classify the given TV program scenes using κ -NN, Gaussian pattern matching algorithm. The classification result of 91.6% reported here shows the promising performance of the video scene classification based on the audio information. Finally, the system stability problem corresponding to different query length is investigated.

Keywords

References

A. Yoshitaka and T. Ichikawa, 'A survey on content-based retrieval for multimedia databases,' IEEE Trans. on Knowledge and Data Engineering, 11(1), Jan. 1999
H. Sundaram and S. Chang, 'Efficient video sequence retrieval in large repositories,' SPIE'99 Storage and Retrieval of Image and Video Databases VII, San Jose, CA, Jan. 1999
N. Bryan-Kinns, 'A framework for video content modeling,' Multimedia tools and applications, 10, pp. 23-45, 2000 https://doi.org/10.1023/A:1009611905060
H. Jiang, T. Lin and H. Zhang, 'Video segmentation with the support of audio segmentation and classification,' ICME'2000-IEEE International Conference on Multimedia and Expo, NY, USA, July 2000
C. Saraceno and R. Leonardi, 'Audio as a support to scene change detection and characterization of video sequences,' Proc. Of ICASSP97, Munich, Germany, April 1997, pp. 2597-2600
J. Boreczky and L. Wiicox,'A Hidden Markov Model framework for video segmentation using audio and image features,' Proc. Of ICASSP'98, pp. 3741-3744, Seattle, May 1998
T. Zhang and C. Kuo, 'Video content parsing based on combined audio and visual information,' SPIE 1999, IV, pp. 78-89
Z. Liu and J. Huang and Y. Wang, 'Classification of TV programs based on audio information using Hidden Markov Model', Proc. of MMSP'98, Redonda Beach, CA, pp. 27-31, Dec 1998
M. Liu and C. Wan, 'A study on content-based classification retrieval of audio database,' Proc. of the International Database Engineering & Applications Symposium, pp. 339 - 345. 2001
R. Duda, P. Hart and D. Stork, Pattern Classification, 2nd Ed., Wiley-Interscience Publication, 2001

The Journal of the Acoustical Society of Korea

Classification of TV Program Scenes Based on Audio Information

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)