Search | Korea Science

Vocal Tract Normalization Using The Power Spectrum Warping (파워 스펙트럼 warping을 이용한 성도 정규화)

Yu, Il-Su;Kim, Dong-Ju;No, Yong-Wan;Hong, Gwang-Seok
- Proceedings of the KIEE Conference
- /
- 2003.11b
- /
- pp.215-218
- /
- 2003
The method of vocal tract normalization has been known as a successful method for improving the accuracy of speech recognition. A frequency warping procedure based low complexity and maximum likelihood has been generally applied for vocal tract normalization. In this paper, we propose a new power spectrum warping procedure that can be improve on vocal tract normalization performance than a frequency warping procedure. A mechanism for implementing this method can be simply achieved by modifying the power spectrum of filter bank in Mel-frequency cepstrum feature(MFCC) analysis. Experimental study compared our Proposal method with the well-known frequency warping method. The results have shown that the power spectrum warping is better 50% about the recognition performance than the frequency warping.
PDF

A New Power Spectrum Warping Approach to Speaker Warping (화자 정규화를 위한 새로운 파워 스펙트럼 Warping 방법)

유일수;김동주;노용완;홍광석
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.41 no.4
- /
- pp.103-111
- /
- 2004
The method of speaker normalization has been known as the successful method for improving the accuracy of speech recognition at speaker independent speech recognition system. A frequency warping approach is widely used method based on maximum likelihood for speaker normalization. This paper propose a new power spectrum warping approach to making improvement of speaker normalization better than a frequency warping. Th power spectrum warping uses Mel-frequency cepstrum analysis(MFCC) and is a simple mechanism to performing speaker normalization by modifying the power spectrum of Mel filter bank in MFCC. Also, this paper propose the hybrid VTN combined the Power spectrum warping and a frequency warping. Experiment of this paper did a comparative analysis about the recognition performance of the SKKU PBW DB applied each speaker normalization approach on baseline system. The experiment results have shown that a frequency warping is 2.06%, the power spectrum is 3.06%, and hybrid VTN is 4.07% word error rate reduction as of word recognition performance of baseline system.
PDF KSCI

Normalization and Search of the UV/VIS Spectra Measured from TLC/HPTLC (TLC/HPTLC에서 측정된 자외/가시부 스펙트럼의 표준화 및 검색)

Kang, Jong-Seong
- YAKHAK HOEJI
- /
- v.38 no.4
- /
- pp.366-371
- /
- 1994
To improve the identification power of TLC/HPTLC the in situ reflectance spectra obtained directly from plates with commercial scanner are used. The spectrum normalization should be carried out prior to comparing and searching the spectra from library for the identification of compounds. Because the reflectance does not obey the Lambert-Beer's law, there arise some problems in normalization. These problems could be solved to some extent by normalizing the spectra with regression methods. The spectra are manipulated with the regression function of a curve obtained from the correlation plot. When the parabola was used as the manipulating function, the spectra were identified with the accuracy of 97% and this result was better than that of conventionally used the point and area normalization method.
PDF

DEVELOPMENT OF AN AUTOMATIC PROCESSING PROGRAM FOR BOES DATA II (BOES 관측데이터의 자동처리 프로그램 개발 II)

Kang, Dong-Ii;Park, Hong-Suh;Han, In-Woo;Valyavin, G.;Lee, Byeong-Cheol;Kim, Kang-Min
- Publications of The Korean Astronomical Society
- /
- v.21 no.2
- /
- pp.101-112
- /
- 2006
We developed a new program for automatic continuum normalization of Echelle spectrographic data. Using this algorithm, we have determined spectral continuum of almost BOES data. The first advantage of this algorithm is that we can save much time for continuum determination and normalization. The second advantage is that the result of this algorithm is very reliable for almost spectral type of spectrum. But this algorithm cannot be applied directly to the spectrum which has very strong and broad emission lines, for example Wolf-Rayet type spectrum. We implanted this algorithm to the program which was developed in the previous study. And we introduced more upgraded BOES data reduction program. This program has more convenient graphical user interface environment, so users can easily reduce BOES data. Lastly, we presented the result of study on line profile variation of magnetic Ap/Bp stars analyzed using this program.
https://doi.org/10.5303/PKAS.2006.21.2.101 인용 PDF KSCI

Noise Suppression Method for Restoring Line Spectrum Pair (선스펙트럼 쌍의 복원에 의한 잡음억제 기법)

Choi, Jae-Seung
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.47 no.4
- /
- pp.112-118
- /
- 2010
This paper describes a noise suppression system based on a normalization method using a time-delay neural network and line spectrum pair having a parameter of frequency domain. First, a time-delay neural network is trained using line spectrum pair values of noisy speech signals obtained by linear prediction analysis. After trained the time-delay neural network, the proposed system enhances speech signals that are degraded by a background noise. Accordingly, the proposed time-delay neural network restores from the line spectrum pair values of noisy speech signals to the line spectrum pair values of clean speech signals. It is confirmed that this system is effective for speech signals degraded by a background noise, judging from spectral distortion measurement.
PDF KSCI

NORMALIZATION OF THE HAMILTONIAN AND THE ACTION SPECTRUM

OH YONG-GEUN
- Journal of the Korean Mathematical Society
- /
- v.42 no.1
- /
- pp.65-83
- /
- 2005
In this paper, we prove that the two well-known natural normalizations of Hamiltonian functions on the symplectic manifold ($M,\;{\omega}$) canonically relate the action spectra of different normalized Hamiltonians on arbitrary symplectic manifolds ($M,\;{\omega}$). The natural classes of normalized Hamiltonians consist of those whose mean value is zero for the closed manifold, and those which are compactly supported in IntM for the open manifold. We also study the effect of the action spectrum under the ${\pi}_1$ of Hamiltonian diffeomorphism group. This forms a foundational basis for our study of spectral invariants of the Hamiltonian diffeomorphism in [8].
https://doi.org/10.4134/JKMS.2005.42.1.065 인용 PDF KSCI

A Robust Watermarking Method against Partial Damage and Geometric Attack (부분 손상과 기하학적 공격에 강인한 워터마킹 방법)

Kim, Hak-Soo
- Journal of Korea Multimedia Society
- /
- v.15 no.9
- /
- pp.1102-1111
- /
- 2012
In this paper, we propose a robust watermarking method against geometric attack even though the watermarked image is partially damaged. This method consists of standard image normalization which transforms any image into a predefined standard image and embedding watermark in DCT domain of standard normalized image using spread spectrum technique. The proposed standard image normalization method has an improvement over existing image normalization method, so it is robust to partial damage and geometric attack. The watermark embedding method using spread spectrum technique also has a robustness to image losses such as blurring, sharpening and compressions. In addition, the proposed watermarking method does not need an original image to detect watermark, so it is useful to public watermarking applications. Several experimental results show that the proposed watermarking method is robust to partial damage and various attacks including geometric deformation.
https://doi.org/10.9717/kmms.2012.15.9.1102 인용 PDF KSCI

A Frequency-Domain Normalized MBD Algorithm with Unidirectional Filters for Blind Speech Separation

Kim Hye-Jin;Nam Seung-Hyon
- The Journal of the Acoustical Society of Korea
- /
- v.24 no.2E
- /
- pp.54-60
- /
- 2005
A new multichannel blind deconvolution algorithm is proposed for speech mixtures. It employs unidirectional filters and normalization of gradient terms in the frequency domain. The proposed algorithm is shown to be approximately nonholonomic. Thus it provides improved convergence and separation performances without whitening effect for nonstationary sources such as speech and audio signals. Simulations using real world recordings confirm superior performances over existing algorithms and its usefulness for real applications.
PDF KSCI

Comparison of the Dynamic Time Warping Algorithm for Spoken Korean Isolated Digits Recognition (한국어 단독 숫자음 인식을 위한 DTW 알고리즘의 비교)

홍진우;김순협
- The Journal of the Acoustical Society of Korea
- /
- v.3 no.1
- /
- pp.25-35
- /
- 1984
This paper analysis the Dynamic Time Warping algorithms for time normalization of speech pattern and discusses the Dynamic Programming algorithm for spoken Korean isolated digits recognition. In the DP matching, feature vectors of the reference and test pattern are consisted of first three formant frequencies extracted by power spectrum density estimation algorithm of the ARMA model. The major differences in the various DTW algorithms include the global path constrains, the local continuity constraints on the path, and the distance weighting/normalization used to give the overall minimum distance. The performance criterias to evaluate these DP algorithms are memory requirement, speed of implementation, and recognition accuracy.
PDF

Harmonics-based Spectral Subtraction and Feature Vector Normalization for Robust Speech Recognition

Beh, Joung-Hoon;Lee, Heung-Kyu;Kwon, Oh-Il;Ko, Han-Seok
- Speech Sciences
- /
- v.11 no.1
- /
- pp.7-20
- /
- 2004
In this paper, we propose a two-step noise compensation algorithm in feature extraction for achieving robust speech recognition. The proposed method frees us from requiring a priori information on noisy environments and is simple to implement. First, in frequency domain, the Harmonics-based Spectral Subtraction (HSS) is applied so that it reduces the additive background noise and makes the shape of harmonics in speech spectrum more pronounced. We then apply a judiciously weighted variance Feature Vector Normalization (FVN) to compensate for both the channel distortion and additive noise. The weighted variance FVN compensates for the variance mismatch in both the speech and the non-speech regions respectively. Representative performance evaluation using Aurora 2 database shows that the proposed method yields 27.18% relative improvement in accuracy under a multi-noise training task and 57.94% relative improvement under a clean training task.
PDF

Search Result 55, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)