Classification of TV Program Scenes Based on Audio Information

  • Lee, Kang-Kyu (Division of Information and Computer Science, Dankook University) ;
  • Yoon, Won-Jung (Division of Information and Computer Science, Dankook University) ;
  • Park, Kyu-Sik (Division of Information and Computer Science, Dankook University)
  • Published : 2004.09.01

Abstract

In this paper, we propose a classification system of TV program scenes based on audio information. The system classifies the video scene into six categories of commercials, basketball games, football games, news reports, weather forecasts and music videos. Two type of audio feature set are extracted from each audio frame-timbral features and coefficient domain features which result in 58-dimensional feature vector. In order to reduce the computational complexity of the system, 58-dimensional feature set is further optimized to yield l0-dimensional features through Sequential Forward Selection (SFS) method. This down-sized feature set is finally used to train and classify the given TV program scenes using κ -NN, Gaussian pattern matching algorithm. The classification result of 91.6% reported here shows the promising performance of the video scene classification based on the audio information. Finally, the system stability problem corresponding to different query length is investigated.

Keywords

References

  1. A. Yoshitaka and T. Ichikawa, 'A survey on content-based retrieval for multimedia databases,' IEEE Trans. on Knowledge and Data Engineering, 11(1), Jan. 1999
  2. H. Sundaram and S. Chang, 'Efficient video sequence retrieval in large repositories,' SPIE'99 Storage and Retrieval of Image and Video Databases VII, San Jose, CA, Jan. 1999
  3. N. Bryan-Kinns, 'A framework for video content modeling,' Multimedia tools and applications, 10, pp. 23-45, 2000 https://doi.org/10.1023/A:1009611905060
  4. H. Jiang, T. Lin and H. Zhang, 'Video segmentation with the support of audio segmentation and classification,' ICME'2000-IEEE International Conference on Multimedia and Expo, NY, USA, July 2000
  5. C. Saraceno and R. Leonardi, 'Audio as a support to scene change detection and characterization of video sequences,' Proc. Of ICASSP97, Munich, Germany, April 1997, pp. 2597-2600
  6. J. Boreczky and L. Wiicox,'A Hidden Markov Model framework for video segmentation using audio and image features,' Proc. Of ICASSP'98, pp. 3741-3744, Seattle, May 1998
  7. T. Zhang and C. Kuo, 'Video content parsing based on combined audio and visual information,' SPIE 1999, IV, pp. 78-89
  8. Z. Liu and J. Huang and Y. Wang, 'Classification of TV programs based on audio information using Hidden Markov Model', Proc. of MMSP'98, Redonda Beach, CA, pp. 27-31, Dec 1998
  9. M. Liu and C. Wan, 'A study on content-based classification retrieval of audio database,' Proc. of the International Database Engineering & Applications Symposium, pp. 339 - 345. 2001
  10. R. Duda, P. Hart and D. Stork, Pattern Classification, 2nd Ed., Wiley-Interscience Publication, 2001