• Title/Summary/Keyword: Frequency warping

Search Result 55, Processing Time 0.021 seconds

A New Power Spectrum Warping Approach to Speaker Warping (화자 정규화를 위한 새로운 파워 스펙트럼 Warping 방법)

  • 유일수;김동주;노용완;홍광석
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.4
    • /
    • pp.103-111
    • /
    • 2004
  • The method of speaker normalization has been known as the successful method for improving the accuracy of speech recognition at speaker independent speech recognition system. A frequency warping approach is widely used method based on maximum likelihood for speaker normalization. This paper propose a new power spectrum warping approach to making improvement of speaker normalization better than a frequency warping. Th power spectrum warping uses Mel-frequency cepstrum analysis(MFCC) and is a simple mechanism to performing speaker normalization by modifying the power spectrum of Mel filter bank in MFCC. Also, this paper propose the hybrid VTN combined the Power spectrum warping and a frequency warping. Experiment of this paper did a comparative analysis about the recognition performance of the SKKU PBW DB applied each speaker normalization approach on baseline system. The experiment results have shown that a frequency warping is 2.06%, the power spectrum is 3.06%, and hybrid VTN is 4.07% word error rate reduction as of word recognition performance of baseline system.

Vocal Tract Normalization Using The Power Spectrum Warping (파워 스펙트럼 warping을 이용한 성도 정규화)

  • Yu, Il-Su;Kim, Dong-Ju;No, Yong-Wan;Hong, Gwang-Seok
    • Proceedings of the KIEE Conference
    • /
    • 2003.11b
    • /
    • pp.215-218
    • /
    • 2003
  • The method of vocal tract normalization has been known as a successful method for improving the accuracy of speech recognition. A frequency warping procedure based low complexity and maximum likelihood has been generally applied for vocal tract normalization. In this paper, we propose a new power spectrum warping procedure that can be improve on vocal tract normalization performance than a frequency warping procedure. A mechanism for implementing this method can be simply achieved by modifying the power spectrum of filter bank in Mel-frequency cepstrum feature(MFCC) analysis. Experimental study compared our Proposal method with the well-known frequency warping method. The results have shown that the power spectrum warping is better 50% about the recognition performance than the frequency warping.

  • PDF

Bilingual Voice Conversion Using Frequency Warping on Formant Space (포만트 공간에서의 주파수 변환을 이용한 이중 언어 음성 변환 연구)

  • Chae, Yi-Geun;Yun, Young-Sun;Jung, Jin Man;Eun, Seongbae
    • Phonetics and Speech Sciences
    • /
    • v.6 no.4
    • /
    • pp.133-139
    • /
    • 2014
  • This paper describes several approaches to transform a speaker's individuality to another's individuality using frequency warping between bilingual formant frequencies on different language environments. The proposed methods are simple and intuitive voice conversion algorithms that do not use training data between different languages. The approaches find the warping function from source speaker's frequency to target speaker's frequency on formant space. The formant space comprises four representative monophthongs for each language. The warping functions can be represented by piecewise linear equations, inverse matrix. The used features are pure frequency components including magnitudes, phases, and line spectral frequencies (LSF). The experiments show that the LSF-based voice conversion methods give better performance than other methods.

An Amplitude Warping Approach to Intra-Speaker Normalization for Speech Recognition (음성인식에서 화자 내 정규화를 위한 진폭 변경 방법)

  • Kim Dong-Hyun;Hong Kwang-Seok
    • Journal of Internet Computing and Services
    • /
    • v.4 no.3
    • /
    • pp.9-14
    • /
    • 2003
  • The method of vocal tract normalization is a successful method for improving the accuracy of inter-speaker normalization. In this paper, we present an intra-speaker warping factor estimation based on pitch alteration utterance. The feature space distributions of untransformed speech from the pitch alteration utterance of intra-speaker would vary due to the acoustic differences of speech produced by glottis and vocal tract. The variation of utterance is two types: frequency and amplitude variation. The vocal tract normalization is frequency normalization among inter-speaker normalization methods. Therefore, we have to consider amplitude variation, and it may be possible to determine the amplitude warping factor by calculating the inverse ratio of input to reference pitch. k, the recognition results, the error rate is reduced from 0.4% to 2.3% for digit and word decoding.

  • PDF

Dynamic Response Analysis of Open Section Structures with Warping Restraint Conditions and Impact Load Durations

  • Chun, Dong-Joon
    • International Journal of Advanced Culture Technology
    • /
    • v.8 no.2
    • /
    • pp.159-164
    • /
    • 2020
  • The response analysis of frame structure with open section beams considering warping conditions and short duration load have been performed. When a beam of frame structure is subjected under torsional moment, the cross section will deform a warping as well as twist. For some thin-walled sections warping will be large, and accompanying warping restraint will induce axial and shear stresses and reduce the twist of beam which stiffens the beam in torsion. Because of impact or blast loads, the wave propagation effects become increasingly important as load duration decreases. This paper presents that a warping restraint in finite element model effects the behavior of beam deformation, dynamic mode shape and response analysis. The computer modelling of frame is discussed in linear beam element model and linear thin shell element model, also presents a correlation between computer predicted and actual experimental results for static deflection, natural frequencies and mode shapes of frame. A method to estimate the number of normal modes that are important is discussed.

An Efficient Crosstalk Cancellation Algorithm Using Pole-zero Dewarping (Pole-zero Dewarping을 이용한 효율적인 Crosstalk 제거 알고리듬)

  • Lee Junho;Park Young-cheol;Youn Dae-hee;Jeong Jae-woong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.3
    • /
    • pp.133-140
    • /
    • 2005
  • Crosstalk canceller in stereo channel audio reproduction system has the purpose to deliver desired signals exactly at the listener's ear. Generally. it has a Poor performance in low frequency bands. Frequency-warped Otters are used to provide improved performance in crosstalk canceller for these problems. However. such filters are more complex to implement than conventional filters. This paper presents an efficient method for low-order IIR approximation of frequency warped crosstalk cancellation filters using Pole-zero dewarping. The method preserves the advantages of frequency warping, but has a computational complexity that is similar to the conventional method. This Paper also presents a series of experiments that validate the method of crosstalk canceller.

Generalization of the Spreading Function and Weyl Symbol for Time-Frequency Analysis of Linear Time-Varying Systems

  • Iem, Byeong-gwan
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.11 no.7
    • /
    • pp.628-632
    • /
    • 2001
  • We propose time-frequency (TF) tools for analyzing linear time-varying (LTV) systems and nonstationary random processes. Obtained warping the narrowband Weyl symbol (WS) and spreading function (SF), the new TF tools are useful for analyzing LTV systems and random processes characterized by generalized frequency shifts, This new Weyl symbol (WS) is useful in wideband signal analysis. We also propose WS an tools for analyzing systems which produce dispersive frequency shifts on the signal. We obtain these generalized, frequency-shift covariant WS by warping conventional, narrowband WS. Using the new, generalized WS, we provide a formulation for the Weyl correspondence for linear systems with instantaneous of linear signal transformation as weighted superpositions of non-linear frequency shifts on the signal. Application examples in signal and detection demonstrate the advantages of our new results.

  • PDF

Vocal Tract Length Normalization for Speech Recognition (음성인식을 위한 성도 길이 정규화)

  • 지상문
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.7 no.7
    • /
    • pp.1380-1386
    • /
    • 2003
  • Speech recognition performance is degraded by the variation in vocal tract length among speakers. In this paper, we have used a vocal tract length normalization method wherein the frequency axis of the short-time spectrum associated with a speaker's speech is scaled to minimize the effects of speaker's vocal tract length on the speech recognition performance In order to normalize vocal tract length, we tried several frequency warping functions such as linear and piece-wise linear function. Variable interval piece-wise linear warping function is proposed to effectively model the variation of frequency axis scale due to the large variation of vocal tract length. Experimental results on TIDIGITS connected digits showed the dramatic reduction of word error rates from 2.15% to 0.53% by the proposed vocal tract normalization.

Finding the optimal frequency for trade and development of system trading strategies in futures market using dynamic time warping (선물시장의 시스템트레이딩에서 동적시간와핑 알고리즘을 이용한 최적매매빈도의 탐색 및 거래전략의 개발)

  • Lee, Suk-Jun;Oh, Kyong-Joo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.2
    • /
    • pp.255-267
    • /
    • 2011
  • The aim of this study is to utilize system trading for making investment decisions and use technical analysis and Dynamic Time Warping (DTW) to determine similar patterns in the frequency of stock data and ascertain the optimal timing for trade. The study will examine some of the most common patterns in the futures market and use DTW in terms of their frequency (10, 30, 60 minutes, and daily) to discover similar patterns. The recognized similar patterns were verified by executing trade simulation after applying specific strategies to the technical indicators. The most profitable strategies among the set of strategies applied to common patterns were again applied to the similar patterns and the results from DTW pattern recognition were examined. The outcome produced useful information on determining the optimal timing for trade by using DTW pattern recognition through system trading, and by applying distinct strategies depending on data frequency.

Vibration Characteristics of Thin-Walled Beams (두께가 얇은 단면을 갖는 보의 진동특성)

  • Oh, Sang-Jin;Lee, Jae-Young;Mo, Jeong-Man;Park, Kwang-Kyou
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2004.11a
    • /
    • pp.709-712
    • /
    • 2004
  • A study of the coupled flexural-torsional vibrations of thin-walled beams with monosymmetric cross-section is presented. The governing differential equations for free vibration of such beams are solved numerically to obtain natural frequencies and their corresponding mode shapes. The beam model is based on the Bernoulli-Euler beam theory and the effect of warping is taken into consideration. Numerical results are given for two specific examples of beams with free-free, clamped-free, hinged-hinged, clamped-hinged and clamped-clamped end constraints both including and excluding the effect of warping stiffness. The effect of warping stiffness on the natural frequencies and mode shapes is discussed and it is concluded that substantial error can be incurred if the effect is ignored.

  • PDF