Go to the main menu
Skip to content
Go to bottom
REFERENCE LINKING PLATFORM OF KOREA S&T JOURNALS
> Journal Vol & Issue
The Journal of the Acoustical Society of Korea
Journal Basic Information
Journal DOI :
The Acoustical Society of Korea
Editor in Chief :
Volume & Issues
Volume 18, Issue 8 - Nov 1999
Volume 18, Issue 7 - Oct 1999
Volume 18, Issue 6 - Aug 1999
Volume 18, Issue 5 - Jul 1999
Volume 18, Issue 4 - May 1999
Volume 18, Issue 3 - Apr 1999
Volume 18, Issue 2 - Feb 1999
Volume 18, Issue 1 - Jan 1999
Volume 18, Issue 4E - 00 1999
Volume 18, Issue 3E - 00 1999
Volume 18, Issue 2E - 00 1999
Volume 18, Issue 1E - 00 1999
Selecting the target year
Acoustic Model Improvement and Performance Evaluation of the Variable Vocabulary Speech Recognition System
The Journal of the Acoustical Society of Korea, volume 18, issue 8, 1999, Pages 3~8
Previous variable vocabulary speech recognition systems with context-independent acoustic modeling, could not represent the effect of neighboring phonemes. To solve this problem, we use allophone-based context-dependent acoustic model. This paper describes the method to improve acoustic model of the system effectively. Acoustic model is improved by using allophone clustering technique that uses entropy as a similarity measure and the optimal allophone model is generated by changing the number of allophones. We evaluate performance of the improved system by using Phonetically Optimized Words(POW) DB and PC commands(PC) DB. As a result, the allophone model composed of six hundreds allophones improved the recognition rate by 13% from the original context independent model m POW test DB.
The Continuous Speech Recognition with Prosodic Phrase Unit
The Journal of the Acoustical Society of Korea, volume 18, issue 8, 1999, Pages 9~16
Generally, a speaker structures utterances very clearly by grouping words into phrases. This facilitates the listener's recovery of the meaning of the utterance and the speaker's intention. To this purpose, a speaker uses, among other things, prosodic information such as intonation pause, duration, intensity, etc. The research described here is concerned with the relationship between the strength of prosodic boundaries in spoken utterances as perceived by untrained listeners(Perceptual boundary strength, PBS)-In this paper, the preceptual boundary strength is used as the same meaning of the prosodic boundary strength-and prosodic information. We made a rule determinating the prosodic boundaries and verified the usefulness of the prosodic phrase as a recognition unit. Experiments results showed that the performance of speech recognition(SR) is improved in aspect of recognition rate and time compared with that using sentences as recognition unit. In the future we will suggest the methods that estimate more appropriate boundaries and study more various methods of prosody assisted SR.
The Development of Ultrasonic Pulsed Doppler for the Measurement of Velocity Distribution of Underwater Substances
The Journal of the Acoustical Society of Korea, volume 18, issue 8, 1999, Pages 17~24
Design and Fabrication of Traveling Wave Type Bi-directional Ultrasonic Linear Motors
The Journal of the Acoustical Society of Korea, volume 18, issue 8, 1999, Pages 25~31
This paper describes design and fabrication of a new bi-directional ultrasonic linear motor working by means of a traveling wave. With the finite element method, we design and verify validity of the new structure, and determine its optimal structure, material, size, and boundary conditions for proper generation of the traveling wave. Based on the results, a prototype of the motor has been fabricated and characterized, which thereby proves practical applicability of the new structure.
Inversion of Acoustical Properties of Sedimentary Layers from Chirp Sonar Signals
The Journal of the Acoustical Society of Korea, volume 18, issue 8, 1999, Pages 32~41
In this paper, an inversion method using chirp signals and two near field receivers is proposed. Inversion problems can be formulated into the probabilistic models composed of signals, a forward model and noise. Forward model to simulate chirp signals is chosen to be the source-wavelet-convolution planewave modeling method. The solution of the inversion problem is defined by a posteriori pdf. The wavelet matching technique, using weighted least-squares fitting, estimates the sediment sound-speed and thickness on which determination of the ranges for a priori uniform distribution is based. The genetic algorithm can be applied to a global optimization problem to find a maximum a posteriori solution for determined a priori search space. Here the object function is defined by an L₂norm of the difference between measured and modeled signals. The observed signals can be separated into a set of two signals reflected from the upper and lower boundaries of a sediment. The separation of signals and successive applications of the genetic algorithm optimization process reduce the search space, therefore improving the inversion results. Not only the marginal pdf but also the statistics are calculated by numerical evaluation of integrals using the samples selected during importance sampling process of the genetic algorithm. The examples applied here show that, for synthetic data with noise, it is possible to carry out an inversion for sedimentary layers using the proposed inversion method.
Efficient Mixture IMM Algorithm for Speech Enhancement under Nonstationary Additive Colored Noise
The Journal of the Acoustical Society of Korea, volume 18, issue 8, 1999, Pages 42~47
In this paper, a mixture interacting multiple model (MIMM) algorithm is proposed to enhance speech contaminated by additive nonstationary noise. In this approach, a mixture hidden filter model (HFM) is used to model the clean speech and the noise process is modeled by a single hidden filter. The MIMM algorithm, however. needs large computation time because it is a recursive method based on multiple Kalman filters with mixture HFM. Thereby, a computationally efficient implementation of the algorithm is developed by exploiting the structure of the Kalman filtering equation. The simulation results show that the proposed method offers performance gain compared to the previous results in [4,5] with slightly increased complexity.
Analytic Derivation of the Finite Wordlength Effect of the Twiddle Factors in Recursive Implementation of the Sliding-DFT
The Journal of the Acoustical Society of Korea, volume 18, issue 8, 1999, Pages 48~53
This paper presents an analytic derivation of the erroneous effect when the sliding-DFT is implemented in a recursive way with the finite-bit approximation of the twiddle factors. The analysis result is obtained in a closed form equation of the noise-to-signal power ratio(NSR) employing the zero-mean white Gaussian signal as the target input of the DFT. The parameters of the wordlength used in representing the twiddle factors and the blocklength of the DFT appear in the NSR explicitly as its function variables. The derivation is based on the error dynamic equation which is derived from the recursive SDFT, and on the analytic exploration of the statistical characteristics of the approximation coefficients treating them as random variables of having spatial distributions. The analytically derived results are verified through the comparison with the data actually measured from the computer simulation experiment.
Propagation Speed of Torsional Elastic Waves In a Cylinder with a Periodically Corrugated Outer Surface
The Journal of the Acoustical Society of Korea, volume 18, issue 8, 1999, Pages 54~60
The paper describes a theoretical study on the speed of the torsional elastic waves propagating in a circular cylinder whose outer radius varies periodically as a harmonic function of the axial coordinate. The approximate solution for the phase speed has been obtained using the perturbation technique for sinusoidal modulation of small amplitude. It is shown that the wave speed in the cylinder with a corrugated outer surface is less than that in a smooth cylinder by the square of the amplitude of the surface perturbation. This theoretical prediction agrees reasonably with an experimental observation reported earlier. It is also shown that the wave speed reduction due to the surface corrugation becomes larger for a thinner cylinder and for a bigger density of corrugation.
An Algorithm on Optimum Weighting Design in Beamforming for Acoustic Measurement
The Journal of the Acoustical Society of Korea, volume 18, issue 8, 1999, Pages 61~67
This paper proposes a new beamforming algorithm for acoustic measurement by using the nested linear array. In this algorithm, the weighting is optimized by minimizing the LMS error with the initial value obtained by FIR filter design algorithm. The optimization process is applied to each sub-band, which is divided from the octave band, to produce the uniform directivity index. For the optimization pseudo inverse matrix is used for the transfer matrix. As the simulation results, it is found that the proposed algorithm can get the desired beam pattern and unform directivity index so as to be used efficiently for the acoustic measurement by using a nested linear array.
Voice and Image : A Perception Experiment
The Journal of the Acoustical Society of Korea, volume 18, issue 8, 1999, Pages 68~74
To investigate the relationship between voice and the image created by the voice, a perception experiment was undertaken in which voices of 8 males and 8 females reading a fairy tale were recorded and presented to 100 subjects along with the speakers' pictures. Two kinds of pictures were taken: one, a whole-body picture, and the other, a face-only close-up picture. Subjects were asked to match each voice with its owner. The results showed that out of 8, more than 4 were matched correctly. And the correct rate was higher when the voices were matched against the whole-body pictures than against the face-only pictures. In addition, it was noted that far more people picked a single voice as the most 'favorite' one than they picked one picture as the most 'favorite' image.
F0 Contour Model based on Temporal Decomposition
The Journal of the Acoustical Society of Korea, volume 18, issue 8, 1999, Pages 75~83
This paper proposes a new F0 contour model for intonation control in speech synthesis. We assume that the F0 contour of an utterance can be described using a sequence of time-overlapping events, which determine the fluctuation of a given F0 contour, described by asymmetric Gaussian functions. In addition, We propose a parameter estimation algorithm for the proposed model. The proposed model is not developed with a particular phonological theory in mind, and can be used in both F0 contour analysis and synthesis. For testing our F0 model, we collected 500 sentences from various genres and built a corresponding speech corpus uttered by a professional female announcer. As n result of F0 resynthesis experiment using the proposed model, the RMSE was 7.87Hz for given speech corpus.
A Study on the Determination of Grain Size of Heat-treated Stainless Steel Using Digital Ultrasonic Signal Processing Techniques.
The Journal of the Acoustical Society of Korea, volume 18, issue 8, 1999, Pages 84~93
Determination of grain size of heat-treated stainless steel based fm digital ultrasonic signal processing technique is presented. This techniques consist in evidence accumulation with multiple feature parameters, difference absolute mean value(DAMV), variance(VAR), mean frequency (MEANF), auto regressive model coefficient(ARC) and linear cepstrum coefficient(LCC). Feature parameters were extracted from ultrasonic echo signal of heat-treated metals. It was found that a few parameters might not be sufficient to exactly evaluate the grain size of heat-treated metals. The determination of grain size of heat-treated metals was carried out through the evidence accumulation procedure using the distances measured with reference parameters. A fuzzy mapping function is designed to transform the distances for the application of the evidence accumulation method. In the work presented, heat-treated stainless steel samples with various grain sizes are examined. The processed experimental results supports the feasibility of the grain size determination technique presented.
A Study on Hybrid Structure of Semi-Continuous HMM and RBF for Speaker Independent Speech Recognition
The Journal of the Acoustical Society of Korea, volume 18, issue 8, 1999, Pages 94~99
It is the hybrid structure of HMM and neural network(NN) that shows high recognition rate in speech recognition algorithms. And it is a method which has majorities of statistical model and neural network model respectively. In this study, we propose a new style of the hybrid structure of semi-continuous HMM(SCHMM) and radial basis function(RBF), which re-estimates weighting coefficients probability affecting observation probability after Baum-Welch estimation. The proposed method takes account of the similarity of basis Auction of RBF's hidden layer and SCHMM's probability density functions so as to discriminate speech signals sensibly through the learned and estimated weighting coefficients of RBF. As simulation results show that the recognition rates of the hybrid structure SCHMM/RBF are higher than those of SCHMM in unlearned speakers' recognition experiment, the proposed method has been proved to be one which has more sensible property in recognition than SCHMM.
A Study on the Propagation Prediction Model for the Microcell Mobile Communication
The Journal of the Acoustical Society of Korea, volume 18, issue 8, 1999, Pages 100~107
When a subscriber service composed along the central street of urban in microcell and picocell mobile communication of cellular method, we proposed the propagation prediction model that mobile communication environment of urban can analyze exactly and faster men than a precedent. We simulate the proposed propagation prediction model under the urban propagation environment of PCS mobile communication and analyze distribution of received field strength in cell. As a results, we show the optimal condition of the transmitting power and the position of the base station in the microcell and the picocell mobile communication.
A Study on the Car Audio Sound Quality Enhancement under Vehicle Noise and Its Subjective Evaluation
The Journal of the Acoustical Society of Korea, volume 18, issue 8, 1999, Pages 108~115
In this study we suggested a digital filter method to enhance car audio sound quality against the sound distortion due to cabin's acoustic characteristics and car driving noises. The digital filters designed were based on the characteristics on car driving noises and cabin acoustic characteristics. Car driving noises were analyzed by two ways; one is an objective method, octave band frequency analysis method. The other is a subjective method; sensory evaluation method, NCB method. On these results, seven sets of modified coefficients of eleven band digital filters were obtained. To find optimum audio sound quality among nine sound samples filtered by designing seven types of digital filters, which were mixed car driving noises at 100km/h, subjective evaluation method was used, paired comparison method; Scheffe' seven point method.