Go to the main menu
Skip to content
Go to bottom
REFERENCE LINKING PLATFORM OF KOREA S&T JOURNALS
> Journal Vol & Issue
The Journal of the Acoustical Society of Korea
Journal Basic Information
Journal DOI :
The Acoustical Society of Korea
Editor in Chief :
Volume & Issues
Volume 11, Issue 6 - Dec 1992
Volume 11, Issue 4 - Aug 1992
Volume 11, Issue 3 - Jun 1992
Volume 11, Issue 2 - Apr 1992
Volume 11, Issue 1 - Feb 1992
Volume 11, Issue 1E - 00 1992
Selecting the target year
A Study on the Spoken KOrean-Digit Recognition Using the Neural Netwok
Park, Hyun-Hwa ; Gahang, Hae Dong ; Bae, Keun Sung ;
The Journal of the Acoustical Society of Korea, volume 11, issue 3, 1992, Pages 5~13
Taking devantage of the property that Korean digit is a mono-syllable word, we proposed a spoken Korean-digit recognition scheme using the multi-layer perceptron. The spoken Korean-digit is divided into three segments (initial sound, medial vowel, and final consonant) based on the voice starting / ending points and a peak point in the middle of vowel sound. The feature vectors such as cepstrum, reflection coefficients,
energy are extracted from each segment. It has been shown that cepstrum, as an input vector to the neural network, gives higher recognition rate than reflection coefficients. Regression coefficients of cepstrum did not affect as much as we expected on the recognition rate. That is because, it is believed, we extracted features from the selected stationary segments of the input speech signal. With 150 ceptral coefficients obtained from each spoken digit, we achieved correct recognition rate of 97.8%.
A Study on the Word Recognition of Korean Speech using Neural Network- A study on the initial consonant Recognition using composite Neural Network
Kim, Suk-Dong ; Lee, Haing-Sei ;
The Journal of the Acoustical Society of Korea, volume 11, issue 3, 1992, Pages 14~24
This paper is a study on the consonant recognition using neural network. First, the part of consonant was separated from the sound of vowel and consonant by the use of acoustic parameter. The rate of length vs. zero crossing rate in the sound of consonant had been studied by dividing each consonant into several groups. Finally, for the purpose of consonant recognition, the composite neural network which consists of a control network and several sub-network is proposed. The control network identifies the group to which the input consonant belongs and the sub-network recognizes the consonant in each group.
Active Vibration Control of a Simply Supported Plate with Piezoelectric Sensors and Actuators - I. Theory
Roh, Yong-Rae ;
The Journal of the Acoustical Society of Korea, volume 11, issue 3, 1992, Pages 25~39
Undesired vibratory motion of a simply supported plate is controlled with piezoelectric sensors and actuators. Appropriate dynamic equations of the sensor and actuator are derived and coupled with the dynamic equation of the plate for the construction of an active feedback vibration control system. Analytic solutions are obtained for amplitude response of the plate, reflecting the combined effect of external driving forces and piezoelectric control moments. Numerical examples are presented to illustrate the effectiveness of this approach for two types of external forces, i.e. a concentrated point load and a piezoelectric plate driver. Calculation results show that the sensors and actuators can be efficient tools to mitigate the sensitivity of the structure to external sources of vibration. The method investigated in this work is applicable to arbitrary external loading conditions and control algorithms.
Speaker-adaptive Word Recognition Using Mapped Membership Function
Lee, Ki-Yeong ; Choi, Kap-Seok ;
The Journal of the Acoustical Society of Korea, volume 11, issue 3, 1992, Pages 40~52
In this paper, we propose the speaker adaptive word recognition method using a mapped membership function, in order to absorb a fluctuation owing to personal difference which is a problem of speaker independent speech recognition. In the training procedure of this method, the mapped membership function is made with the fuzzy theory introducded into a mapped codebook, between an unknown speaker's spectrum pattern and a standard speaker's one. In the recognition procedure, an input pattern of an unknown speaker is reconstructed to the pattern which is adapted to that of a standard speaker by the mapped membership function. To show the validity of this method, word recognition experiments are carried out using 28 DDD area names. The recognition rate of the conventional speaker-adaptive method using a mapped codebook by VQ is 64.9[%], and that made by a fuzzy VQ is 76.2[%]. Throughout the experiment using a mapped membership function, we can achieve 95.4[%] recognition rate. This shows that our proposed method is more excellent in recognition performance. Moreover, this method doesn't need an iterative training procedure to make the mapped membership function, and memory capacity and computation requirements for this method are reduced to 1/30 and 1/500 time of those for the conventional method using a mapped codebook, respectively.
An Adaptive Microphone Array with Linear Phase Response
Kang, Hong-Gu ; Youn, Dae-Hui ; Cha, Il-Hwan ;
The Journal of the Acoustical Society of Korea, volume 11, issue 3, 1992, Pages 53~60
Many adaptive beamforming methods have been studied for interference cancellation and speech signal enhancement in telephone conference and auditorium. Main aspect of adaptive beamforming methods for speech signal processing is different from radar, sonar and seismic signal processing because desire output signal should be apt to the human ear. Considering that phase of speech is quite insensible to the human ear, Sondhi proposed a nonlinear constrained optimization technique whose constraint was on the magnitude transfer function from the source to the output. In real environment the phase response of the speech signal affects the human auditorium system. So it is desirable to design linear phase system. In this paper, linear phase beamformer is proposed and sample processing algorithm is also proposed for real time consideration Simulation results show that the proposed algorithm yields more consistent beam patterns and deep nulls to the noise direction than Sondhi's.
A Comparative Study of Speech Parameters for Speech Recognition Neural Network
Kim, Ki-Seok ; Im, Eun-Jin ; Hwang, Hee-Yung ;
The Journal of the Acoustical Society of Korea, volume 11, issue 3, 1992, Pages 61~66
There have been many researches that uses neural network models for automatic speech recognition, but the main trend was finding the neural network models and learning rules appropriate to automatic speech recognition. However, the choice of the input speech parameter for the neural network as well as neural network model itself is a very important factor for the improvement of performance of the automatic speech recognition system using neural network. In this paper we select 6 speech parameters from surveys of the speech recognition papers which uses neural networks, and analyze the performance for the same data and the same neural network model. We use 8 sets of 9 Korean plosives and 18 sets of 8 Korean vowels. We use recurrent neural network and compare the performance of the 6 speech parameters while the number of nodes is constant. The delta cepstrum of linear predictive coefficients showed best result and the recognition rates are 95.1% for the vowels and 100.0% for plosives.
An Electromagnetic Induction Underwater Acoustic Transducer Design And Its Characteristics
Park, Yoon-Kyu ; Ha, Kang-Iyeol ; Chang, Jee-Won ;
The Journal of the Acoustical Society of Korea, volume 11, issue 3, 1992, Pages 67~73
An electromagnetic induction transducer which radiates an implusive ultrasonic wave, could be useful for high power sound source in underwater acoustical survey. The aim of this study is to investigate the waveforms and the spectral characteristics of the waves produced by the transducer which was on the basis of the design criteria proposed by Eisenmenger. It was found that the sound pressure levels of the impulsive ultrasonic waves were directly proportional to the capacitance of the energy stored in the capacitor and inversely proportional to the thickness of the vibrating copper foil, while the center frequencies and the fractional bandwidths were vice versa.
A Review on the Acoustic Signal Processing for Sound Field Communication
Kang, Seong-Hoon ;
The Journal of the Acoustical Society of Korea, volume 11, issue 3, 1992, Pages 74~82
Estimation of Glottal waveform
Lee, Jung-Chul ; Ann, Sou-Guil ;
The Journal of the Acoustical Society of Korea, volume 11, issue 3, 1992, Pages 83~93