Advanced SearchSearch Tips
A NMF-Based Speech Enhancement Method Using a Prior Time Varying Information and Gain Function
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
 Title & Authors
A NMF-Based Speech Enhancement Method Using a Prior Time Varying Information and Gain Function
Kwon, Kisoo; Jin, Yu Gwang; Bae, Soo Hyun; Kim, Nam Soo;
  PDF(new window)
This paper presents a speech enhancement method using non-negative matrix factorization. In training phase, we can obtain each basis matrix from speech and specific noise database. After training phase, the noisy signal is separated from the speech and noise estimate using basis matrix in enhancement phase. In order to improve the performance, we model the change of encoding matrix from training phase to enhancement phase using independent Gaussian distribution models, and then use the constraint of the objective function almost same as that of the above Gaussian models. Also, we perform a smoothing operation to the encoding matrix by taking into account previous value. Last, we apply the Log-Spectral Amplitude type algorithm as gain function.
speech enhancement;NMF;Gaussian distribution model;smoothing;Log-Spectral Amplitude;
 Cited by
잡음 데이터를 활용한 음성 기저 행렬과 NMF 기반 음성 향상 기법,권기수;김형용;김남수;

한국통신학회논문지, 2015. vol.40. 4, pp.619-627 crossref(new window)
확률적 비음수 행렬 인수분해를 사용한 통계적 음성검출기법,김동국;신종원;권기수;김남수;

한국통신학회논문지, 2016. vol.41. 8, pp.851-858 crossref(new window)
Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator," IEEE Trans. Acoust. Speech Signal Process., vol. 33 no. 2, pp. 443-445, Apr. 1985. crossref(new window)

I. Cohen and B. Berdugo, "Speech enhancement for non-stationary noise environments," Signal Process. vol. 81, no. 11, pp. 2403-2418, Nov. 2001. crossref(new window)

N. S. Kim and J.-H. Chang, "Spectral enhancement based on global soft decision," IEEE Signal Process. Lett. vol. 7, no. 5, pp. 108-110, May 2000. crossref(new window)

J.-H. Chang and N.S. Kim, "Noisy speech enhancement based on multiple statistical models," Telecommun. Review, vol. 16, no. 4, pp.731-747, Aug. 2006.

D. D. Lee and H. S. Seung, "Learning the parts of objects by nonnegative matrix factorization," Nature, vol. 401, pp. 788-791, Oct. 1999. crossref(new window)

C.-J. Lin, "Projected gradient methods for non-negative matrix factorization," Neural Computation. vol. 19, no. 10, pp. 2756-2779, Oct. 2007. crossref(new window)

R. Zdunek and A. Cichocki, "Non-negative matrix factorization with quasi-Newton optimization," in Proc. 8th Int. Conf. Artificial Intell. Soft Comput. (ICAISC 2006), pp. 870-879, Zakopane, Poland, June 2006.

A. Cichocki, R. Zdunek, and S. Amari, "New algorithms for non-negative matrix factorization in application to blind source separation," IEEE Acoust. Speech Signal Process., vol. 5, pp. 14-19, May 2006.

T. Virtanen, "Monaural sound source separation by nonnegative matrix factorization With temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech, Language Process., vol. 15, no. 3, pp. 1066-1074, Mar. 2007. crossref(new window)

P. D. O'Grady and B. A. Pearlmutter, "Convolutive non-negative matrix factorization with a sparseness constraint," in Proc. 16th IEEE Signal Process. Soc. Workshop Machine Learning Signal Process., pp. 427-432, Maynooth, Ireland, Sep. 2006.

A. Pascual-Montano, J. M. Carazo, K. Kochi, D. Lehmann, and R. D. Pascual-Marqui, "Nonsmooth nonnegative matrix factorization (nsNMF)," IEEE Trans. Pattern Anal. Machine Intell., vol. 28, no. 3, pp. 403-415, Mar. 2006. crossref(new window)

P. O. Hoyer, "Non-negative sparse coding," in Proc. IEEE Workshop Neural Networks for Signal Process., pp. 557-565, Martigny, Switzerland, Sep. 2002.

D. Wang and J. Lim, "The unimportance of phase in speech enhancement," IEEE Trans. Acoust. Speech Signal Process., vol. 30, no. 4, pp. 679-681, Aug. 1982. crossref(new window)

K. W. Wilson, B. Raj, P. Smaragdis, and A. Divakaran, "Speech denoising using nonnegative matrix factorization with priors," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. 2008, pp. 4029-4032, Las Vegas, U.S.A., Apr. 2008.