Advanced SearchSearch Tips
Improvement of convergence speed in FDICA algorithm with weighted inner product constraint of unmixing matrix
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
  • Journal title : Phonetics and Speech Sciences
  • Volume 7, Issue 4,  2015, pp.17-25
  • Publisher : The Korean Society of Speech Sciences
  • DOI : 10.13064/KSSS.2015.7.4.017
 Title & Authors
Improvement of convergence speed in FDICA algorithm with weighted inner product constraint of unmixing matrix
Quan, Xingri; Bae, Keunsung;
  PDF(new window)
For blind source separation of convolutive mixtures, FDICA(Frequency Domain Independent Component Analysis) algorithms are generally used. Since FDICA algorithm such as Sawada FDICA, IVA(Independent Vector Analysis) works on the frequency bin basis with a natural gradient descent method, it takes much time to converge. In this paper, we propose a new method to improve convergence speed in FDICA algorithm. The proposed method reduces the number of iteration drastically in the process of natural gradient descent method by applying a weighted inner product constraint of unmixing matrix. Experimental results have shown that the proposed method achieved large improvement of convergence speed without degrading the separation performance of the baseline algorithms.
Hadamard product form;FDICA;
 Cited by
P. Comon, (1994). Independent component analysis, a new concept?, Signal Processing, vol. 36, pp. 287-314. crossref(new window)

A. Hyvarinen, J. Karhunen, and E. Oja, (2001). Independent component analysis. John Wiley & Sons.

A. Cichocki and S. Amari, (2003). Adaptive Blind Signal and Image Processing (Learning Algorithms and Applications). New York: John Wiley.

A. J. Bell and T. J. Sejnowski, (1995). An information -maximization approach to blind separation and blind deconvolution. Neural Computation, vol. 7, pp. 1129-1159. crossref(new window)

S.-i. Amari, A. Cichocki, and H. H. Yang, (1996). A new learning algorithm for blind signal separation. Advances in neural information processing systems, pp. 757-763.

A. Hyvarinen and E. Oja, (1997). A fast fixed-point algorithm for independent component analysis. Neural Computation, vol. 9, pp. 1483-1492. crossref(new window)

H. Sawada, R. Mukai, S. Araki, and S. Makino, (2001). A polar-coordinate based activation function for frequency domain blind source separation. in Proc. Int. Conf. ICA and BSS, pp. 663-668.

Kim, T., H. T. Attias, Lee, S..-Y., and Lee, T.-W., (2007). Blind source separation exploiting higher-order frequency dependencies. Audio, Speech, and Language Processing, IEEE Transactions on, vol. 15, pp. 70-79. crossref(new window)

F. Nesta, P. Svaizer, and M. Omologo, (2011). Convolutive BSS of short mixtures by ICA recursively regularized across rrequencies. Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, pp. 624-639. crossref(new window)

K. Matsuoka, (2002). Minimal distortion principle for blind source separation. Proceedings of the 41st SICE Annual Conference, vol.4, pp. 2138-2143.

H. Sawada, S. Araki, R. Mukai, and S. Makino, (2007). Grouping separated frequency components by estimating propagation model parameters in frequency-domain blind source separation. Audio, Speech, and Language Processing, IEEE Transactions on, vol. 15, pp. 1592-1604. crossref(new window)

H. Sawada, S. Araki, and S. Makino, (2007) Measuring dependence of bin-wise separated signals for permutation alignment in frequency-domain BSS. in Circuits and Systems, 2007. ISCAS 2007. IEEE International Symposium on, pp. 3247-3250.

Lee, I. and Jamg, G. J., (2012). Independent vector analysis based on overlapped cliques of variable width for frequency-domain blind signal separation. EURASIP Journal on Advances in Signal Processing, vol. 2012, p. 113. crossref(new window)

Lee, I., Kim, T. and Lee, T.-W., (2007). Fast fixed-point independent vector analysis algorithms for convolutive blind source separation. Signal Processing, vol. 87, pp. 1859-1871. crossref(new window)

S. Haykin, (2009). Neural networks and learning machines vol. 3. Prentice Hall.

E. A. Lehmann. (2012). Image-source method: Matlab code implementation.

K. d. Donohue. (2009). Audio Systems Lab Experimental Data.

Quan, X. and Bae, K. (2014). Frequency bin alignment using covariance of power ratio of separated signals in multi-channel FD-ICA. Phonetics and Speech Sciences, Vol6, pp. 149-153. (전성일, 배건성, (2014). 다채널 주파수영역 독립성분분석에서 분리된 신호 전력비의 공분산을 이용한 주파수 빈 정렬. 말소리와 음성과학, vol. 6, pp. 149-153.)