Parts-Based Feature Extraction of Spectrum of Speech Signal Using Non-Negative Matrix Factorization

  • Park, Jeong-Won (Department of Electronic Engineering, Dong-A University) ;
  • Kim, Chang-Keun (Department of Electronic Engineering, Dong-A Universit) ;
  • Lee, Kwang-Seok (Department of Electronic Engineering, Jinju National Universit) ;
  • Koh, Si-Young (School of Electronic Information and Communication Engineering, Kyungil Universit) ;
  • Hur, Kang-In (Department of Electronic Engineering, Dong-A University)
  • Published : 2003.12.01


In this paper, we proposed new speech feature parameter through parts-based feature extraction of speech spectrum using Non-Negative Matrix Factorization (NMF). NMF can effectively reduce dimension for multi-dimensional data through matrix factorization under the non-negativity constraints, and dimensionally reduced data should be presented parts-based features of input data. For speech feature extraction, we applied Mel-scaled filter bank outputs to inputs of NMF, than used outputs of NMF for inputs of speech recognizer. From recognition experiment result, we could confirm that proposed feature parameter is superior in recognition performance than mel frequency cepstral coefficient (MFCC) that is used generally.


Non-Negative Matrix Factorization;Parts-based Feature Extraction;Mel-scaled Filter Bank Output


