• Title/Summary/Keyword: pitch

Search Result 4,217, Processing Time 0.028 seconds

Automatic Recognition of Pitch Accents Using Time-Delay Recurrent Neural Network (시간지연 회귀 신경회로망을 이용한 피치 악센트 인식)

  • Kim, Sung-Suk;Kim, Chul;Lee, Wan-Joo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.4E
    • /
    • pp.112-119
    • /
    • 2004
  • This paper presents a method for the automatic recognition of pitch accents with no prior knowledge about the phonetic content of the signal (no knowledge of word or phoneme boundaries or of phoneme labels). The recognition algorithm used in this paper is a time-delay recurrent neural network (TDRNN). A TDRNN is a neural network classier with two different representations of dynamic context: delayed input nodes allow the representation of an explicit trajectory F0(t), while recurrent nodes provide long-term context information that can be used to normalize the input F0 trajectory. Performance of the TDRNN is compared to the performance of a MLP (multi-layer perceptron) and an HMM (Hidden Markov Model) on the same task. The TDRNN shows the correct recognition of $91.9{\%}\;of\;pitch\;events\;and\;91.0{\%}$ of pitch non-events, for an average accuracy of $91.5{\%}$ over both pitch events and non-events. The MLP with contextual input exhibits $85.8{\%},\;85.5{\%},\;and\;85.6{\%}$ recognition accuracy respectively, while the HMM shows the correct recognition of $36.8{\%}\;of\;pitch\;events\;and\;87.3{\%}$ of pitch non-events, for an average accuracy of $62.2{\%}$ over both pitch events and non-events. These results suggest that the TDRNN architecture is useful for the automatic recognition of pitch accents.

Preparation of pitch-coated $TiO_2$ and their photocatalytic performance

  • Chen, Ming-Liang;Oh, Won-Chun
    • Journal of the Korean Crystal Growth and Crystal Technology
    • /
    • v.17 no.1
    • /
    • pp.23-29
    • /
    • 2007
  • Pitch-coated anatase $TiO_2$ typed was prepared by $CCl_4$ solvent mixing method with different mixing ratios. Since the carbon layers derived from pitch on the $TiO_2$ particles were porous, the pitch-coated $TiO_2$ sample series showed a good adsorptivity and photo decomposition activity. The BET surface area for the carbon layer in the sample increases to increasing with pitch contents. The SEM results present to the characterization of porous texture on the pitch-coated $TiO_2$ sample and pitch distributions on the surfaces for all the materials used. From XRD data a weak and broad carbon peak of graphene with pristine anatase peaks were observed in the X-ray diffraction patterns for the pitch-coated $TiO_2$. The EDX spectra show the presence of C, O and S with strong Ti peaks. Most of these samples are richer in carbon and major Ti metal than any other elements. Finally, the excellent photocatalytic activity of pitch-coated $TiO_2$ with slope relationship between relative concentration of MB ($c/c_o$) and t could be attributed to the homogeneous coated pitch on the external surface by $CCl_4$ solvent method.

A Study on the Pitch Detection of Speech Harmonics by the Peak-Fitting (음성 하모닉스 스펙트럼의 피크-피팅을 이용한 피치검출에 관한 연구)

  • Kim, Jong-Kuk;Jo, Wang-Rae;Bae, Myung-Jin
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.85-95
    • /
    • 2003
  • In speech signal processing, it is very important to detect the pitch exactly in speech recognition, synthesis and analysis. If we exactly pitch detect in speech signal, in the analysis, we can use the pitch to obtain properly the vocal tract parameter. It can be used to easily change or to maintain the naturalness and intelligibility of quality in speech synthesis and to eliminate the personality for speaker-independence in speech recognition. In this paper, we proposed a new pitch detection algorithm. First, positive center clipping is process by using the incline of speech in order to emphasize pitch period with a glottal component of removed vocal tract characteristic in time domain. And rough formant envelope is computed through peak-fitting spectrum of original speech signal infrequence domain. Using the roughed formant envelope, obtain the smoothed formant envelope through calculate the linear interpolation. As well get the flattened harmonics waveform with the algebra difference between spectrum of original speech signal and smoothed formant envelope. Inverse fast fourier transform (IFFT) compute this flattened harmonics. After all, we obtain Residual signal which is removed vocal tract element. The performance was compared with LPC and Cepstrum, ACF. Owing to this algorithm, we have obtained the pitch information improved the accuracy of pitch detection and gross error rate is reduced in voice speech region and in transition region of changing the phoneme.

  • PDF

NUMERICAL ANALYSIS FOR LONGITUDINAL PITCH EFFECT ON TUBE BANK HEAT TRANSFER (관군 배열에서의 종간 간격이 열전달에 미치는 영향에 대한 수치 해석적 연구)

  • Lee, D.;Ahn, J.;Shin, S.
    • Journal of computational fluids engineering
    • /
    • v.17 no.3
    • /
    • pp.39-44
    • /
    • 2012
  • In this study, a longitudinal pitch effect on in-line tube bank heat transfer has been analyzed numerically. To verify the accuracy of the solver model and boundary conditions, global Nusselt number(Nu) and pressure drop across the 2 row tube bank are compared with the existing experimental correlations under 500 ~ 2,000 Reynolds number(Re) range. By changing transverse pitch($S_T$) or longitudinal pitch($S_L$) separately in tube bank, we're trying to identify the each effect on heat transfer. We found that the effect of transverse pitch can be accounted for Reynolds number evaluated with maximum velocity($V_{max}$) at the smallest flow area similar to most existing correlations. Variation of the longitudinal pitch($S_L$) has a greater impact on the heat transfer compared to the transverse pitch($S_T$). Overall Nusselt number increases with larger longitudinal pitch($S_L$), however individual Nusselt number of the tube row has significant difference after the first row.

An Experimental Study on Selection Pitch Angle on backward flow of an Axial Fan with Adjustable Pitch Angle Blades (피치각 조정형 송풍-역풍 겸용 축류팬에서 배연용 피치각 선정을 위한 실험적 연구)

  • Chang, Taek-Soon;Hur, Jin-Huek;Moon, Seung-Jae;Lee, Jae-Heon;You, Ho-Sun;Im, Yun-Chul
    • Proceedings of the SAREK Conference
    • /
    • 2008.11a
    • /
    • pp.145-150
    • /
    • 2008
  • In this study, the experimental study has carried out to select pitch angle on the backward flow in an axial fan that has adjustable pitch blades. With the change of pitch angle of axial fan with adjustable blade, air flow rate, pressure and air flow direction can be changed. Because of this merit, adjustable axial fan can be used in the backward flow. For the selection of the backward flow pitch angle, fan performance test method is selected by KS B 6311. Dynamic pressure, static pressure, electric current and voltage are measured in each pitch angles of axial fan that are $36^{\circ}C$, $-16^{\circ}C$, $-21^{\circ}C$, $-26^{\circ}C$, $-31^{\circ}C$ and $-36^{\circ}C$. In the result of test, fan performance curves at several pitch angle has been investigated. Finally, pitch angle of $-26^{\circ}C$ has been selected to get largest flow rate at backward flow situation.

  • PDF

On a Detection for the Fundamental Frequency of Speech Signals (음성신호의기본주파수 검출)

  • 배명진
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06c
    • /
    • pp.42-47
    • /
    • 1994
  • A pitch detector is an essential component in a variety of speech processing systems. Besides providing valuable insights into the nature of the exciation source for speech production, the pitch contour of an utterance is useful for recognizing speakers, aids-to-the handicapped, and is required in almost all speech analysis-synthesis system. Because of the importance of the pitch detection, a wide variety algorithms for pitch detection have been proposed in speech procesing literature. Thus, in this paper we discuss th evarious type of pitch detection algorithms which have been proposed until now. Then we provide th eperformance measurements for seven pitch detection algorithms.

  • PDF

Modification of Pitch Algorithm and Its Application to Noise (피치 알고리즘 수정 및 소음에의 적용)

  • Shin, Sung-Hwan;Ih, Jeong-Guon
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2002.11a
    • /
    • pp.354.1-354
    • /
    • 2002
  • Pitch is a perception related to frequency, one of the psychological aspects or attributes of tones, and an important factor to determine sound quality of sound together with loudness and timber. while a study on pitch has been actively achieved In the part of speech recognition and speech separation, that for analysis and improvement of product sound quality is not yet enough. (omitted)

  • PDF

An Algorithm to Reduce the Pitch Computational amount using Modified Delta Searching in CELP Vocoders (CELP 보코더에서 델타 피치 검색 방법 개선에 대한 연구)

  • Ju, Sang-Gyu
    • Proceedings of the KAIS Fall Conference
    • /
    • 2010.05a
    • /
    • pp.269-272
    • /
    • 2010
  • In this paper, we propose the computation reduction methods of delta pitch search that is used in G.723.1 vocoder. In order to decrease the computational amount in delta pitch search the characteristic of proposed algorithms is as the following. First, scheme to reduce the computation amount in delta pitch search uses NAMDF. Developed the second scheme is the skipping technique of lags in pitch searching by using the threshold value. By doing so, we can reduce the computational amount of pitch searching more than 64% with negligible quality degradation.

  • PDF

On a Performance Comparison of Pitch Search Algorithms with the Correlation Properties for the CELP Vocoder (상관관계 특성을 이용한 CELP 보코더의 피치검색시간 단축법의 비교)

  • 김대식
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06c
    • /
    • pp.188-194
    • /
    • 1994
  • Code excited linear prediction speech coders exhibit good performance at data rates as low as 4800bps. But the major drawback to CELP type coders is their large computational requirements. Therefore, in this paper a comparative performance study of three pitch searching algorithms for the CELP vocoder was conducted. For each of the algorithms, a standard pitch searching algorithm was used by the full pitch searching algorithm that was implimented in the QCELP vocoder. The algorithms used in this study is to reduce the pitch searching time 1) using the skip table, 2) using the symmetrical property of the autocorrelation , and 3) using the preprocessing autocorrelation, 4) using the positive autocorrelation, 5) using the preliminary pitch. Performance scores are presented for each of the five pitch searching algorithms based on computation speed and on pitch prediction error.

  • PDF

Statistical Approaches to Convert Pitch Contour Based on Korean Prosodic Phrases (한국어 운율구 기반의 피치궤적 변환의 통계적 접근)

  • Lee, Ki-Young
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.1E
    • /
    • pp.10-15
    • /
    • 2004
  • In performing speech conversion from a source speaker to a target speaker, it is important that the pitch contour of the source speakers utterance be converted into that of the target speaker, because pitch contour of a speech utterance plays an important role in expressing speaker's individuality and meaning of the utterance. This paper describes statistical algorithms of pitch contour conversion for Korean language. Pitch contour conversions are investigated at two 1 evels of prosodic phrases: intonational phrase and accentual phrase. The basic algorithm is a Gaussian normalization [7] in intonational phrase. The first presented algorithm is combined with a declination-line of pitch contour in an intonational phrase. The second one is Gaussian normalization within accentual phrases to compensate for local pitch variations. Experimental results show that the algorithm of Gaussian normalization within accentual phrases is significantly more accurate than the other two algorithms in intonational phrase.