Go to the main menu
Skip to content
Go to bottom
REFERENCE LINKING PLATFORM OF KOREA S&T JOURNALS
> Journal Vol & Issue
The Journal of the Acoustical Society of Korea
Journal Basic Information
Journal DOI :
The Acoustical Society of Korea
Editor in Chief :
Volume & Issues
Volume 22, Issue 8 - Nov 2003
Volume 22, Issue 7 - Oct 2003
Volume 22, Issue 6 - Aug 2003
Volume 22, Issue 5 - Jul 2003
Volume 22, Issue 4 - May 2003
Volume 22, Issue 3 - Apr 2003
Volume 22, Issue 2 - Feb 2003
Volume 22, Issue 1 - Jan 2003
Volume 22, Issue 1E - 00 2003
Selecting the target year
Performance Analysis of MVDR and RLS Beamforming Using Systolic Array Structure
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 1~6
This paper analyses the performance of either the minimum variance distortionless response (MVDR) or the recursive least square (RLS) beamformer structured on the systolic array. Provided that the snapshot vector including the desired user's signal and the interferences with the noise is received at the array antenna. In order to improve the quality of received signal, MVDR or RLS algorithm can be utilized to update the beamformer weights recursively. Furthermore to increase the channel capacity, by the usage of the above schemes, the effect of the spatial filtering can be obtained which constructively combining multipath components corresponding to the desired user whereas the multiple access interferences (MAI) is nulled out on spatial domain. This paper introduces the MVDR and RLS beamformer structured on systolic array conducting the spatial filtering, and its performance under the multipath fading channel in the presence of multiple access interferences will be analyzed. To show the superior spatial filtering performances of the proposed scheme employing the systolic way structured beamformer, the computer simulations are carried out. And the validity of practical deployment of the proposed scheme will be confirmed throughout showing the BER behaviors and the beampatterns.
Stereo Vision Neural Networks with Competition and Cooperation for Phoneme Recognition
김성일 ; 정현열 ;
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 3~3
This paper describes two kinds of neural networks for stereoscopic vision, which have been applied to an identification of human speech. In speech recognition based on the stereoscopic vision neural networks (SVNN), the similarities are first obtained by comparing input vocal signals with standard models. They are then given to a dynamic process in which both competitive and cooperative processes are conducted among neighboring similarities. Through the dynamic processes, only one winner neuron is finally detected. In a comparative study, with, the average phoneme recognition accuracy on the two-layered SVNN was 7.7% higher than the Hidden Markov Model (HMM) recognizer with the structure of a single mixture and three states, and the three-layered was 6.6% higher. Therefore, it was noticed that SVNN outperformed the existing HMM recognizer in phoneme recognition.
Effect of Higher Order Form Factors on the Prediction of Room Acoustics by Extended Radiosity Method
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 7~13
Numerous investigations have demonstrated that diffused reflection is one of the most important factors in predicting room acoustics by computer simulation. Recent studies have suggested several computational algorithms in order to account for diffused reflections in the ray tracing or beam tracing method. In this study, a computational algorithm for the calculation of diffuse sound reflections in the image method is suggested and a computer simulation system is developed based on the suggested algorithm. The methodology adopted in our computer simulation system is similar to the extended radiosity method, which is developed for the computer graphics. In a real room, sound energy is reflected in a partially diffused manner which results in four reflection combinations: diffuse-diffuse, specular-specular, diffuse-specular and specular-diffuse. In this study, higher order form factor is introduced to handle the four types of reflection combinations so that the partially diffused reflection could be modeled. In this paper, the concept of extended radiosity method is described and the approximate method of calculating higher order form factor is suggested. Finally, the effect of higher order form factors on the simulation of reverberation time is investigated.
Exterior Acoustic Holography Reconstruction of a Tuning Fork Using Inverse Non-singular BEM
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 11~11
Non-singular boundary element method (BEM) codes are developed in acoustics application. The BEM code is then used to calculate unknown boundary surface normal displacements and surface pressures from known exterior near field pressures. And then the calculated surface normal displacements and surface pressures are again applied to the BEM in forward in order to calculate reconstructed field pressures. The initial exterior near field pressures are very well agreed with the later reconstructed field pressures. Only the same number of boundary surface nodes (1178) are used for the initial exterior pressures which are at first calculated by Finite Element Method (FEM) and BEM. Pseudo-inverse technique is, used for the calculation of the unknown boundary surface normal displacements. The structural object is a tuning fork with 128.4 ㎐ resonant. The boundary element is a quadratic hexahedral element (eight nodes per element).
Effects of Speaking Rate on Korean Vowels
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 14~22
In this study, we examined the acoustic characteristics of Korean vowels through a production test under three conditions of speaking rates (slow, normal, fast). The effects of a change in speaking .ate on vowel duration were found to be very strong. The faster speaking rate was, the shorter the total duration of vowels was. But the duration ratio of two components of diphthong was not changed significantly according to changes in speaking rate. But unlike the temporal aspects, the formant value of vowels at their steady-state and change ratio of formant of semivowels were not affected strongly by the change in speaking rate.
VFF-PASTd Based Multiple Target Angle Tracking with Angular Innovation
임준석 ; 최용진 ; 윤석준 ;
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 19~19
Ryu et al. recently proposed a multiple target angle-tracking algorithm without a data association problem. This algorithm, however, shows the degraded performance on evasive maneuvering targets, because the estimated signal subspace is d,:graded in the algorithm. In this Paper, we proposed a new algorithm, in which VFF-PASTd (Variable Forgetting Factor PASTd) algorithm is applied to Ryu's algorithm to effectively handle the evasive target tracking with better time-varying signal subspace.
Effective Syllable Modeling for Korean Speech Recognition Using Continuous HMM
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 23~27
Recently attempts to we the syllable as the recognition unit to enhance performance in continuous speech recognition hate been reported. However, syllables are worse in their trainability than phones and the former have a disadvantage in that contort-dependent modeling is difficult across the syllable boundary since the number of models is much larger for syllables than for phones. In this paper, we propose a method to enhance the trainability for the syllables in Korean and phoneme-context dependent syllable modeling across the syllable boundary. An experiment in which the proposed method is applied to word recognition shows average 46.23% error reduction in comparison with the common syllable modeling. The right phone dependent syllable model showed 16.7% error reduction compared with a triphone model.
Formulation of New Hyperbolic Time-shift Covariant Time-frequency Symbols and Its Applications
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 26~26
We propose new time-frequency (TF) tools for analyzing linear time-varying (LTV) systems and nonstationary random processes showing hyperbolic TF structure. Obtained through hyperbolic warping the narrowband Weyl symbol (WS) and spreading function (SF) in frequency, the new TF tools are useful for analyzing LTV systems and random processes characterized by hyperbolic time shifts. This new TF symbol, called the hyperbolic WS, satisfies the hyperbolic time-shift covariance and scale covariance properties, and is useful in wideband signal analysis. Using the new, hyperbolic time-shift covariant WS and 2-D TF kernels, we provide a formulation for the hyperbolic time-shift covariant TF symbols, which are 2-D smoothed versions of the hyperbolic WS. We also propose a new interpretation of linear signal transformations as weighted superposition of hyperbolic time shifted and scale changed versions of the signal. Application examples in signal analysis and detection demonstrate the advantages of our new results.
A Study on the Robust Bimodal Speech-recognition System in Noisy Environments
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 28~34
Recent researches have been focusing on jointly using lip motions (i.e. visual speech) and speech for reliable speech recognitions in noisy environments. This paper also deals with the method of combining the result of the visual speech recognizer and that of the conventional speech recognizer through putting weights on each result: the paper proposes the method of determining proper weights for each result and, in particular, the weights are autonomously determined, depending on the amounts of noise in the speech and the image quality. Simulation results show that combining the audio and visual recognition by the proposed method provides the recognition performance of 84% even in severely noisy environments. It is also shown that in the presence of blur in images, the newly proposed weighting method, which takes the blur into account as well, yields better performance than the other methods.
An Improved Secondary Path Modeling Method by Modified Kuo Model
박병욱 ; 김학윤 ;
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 33~33
Kuo et al proposed an on-line method for an adaptive prediction error filter for improving secondary path modeling performance in the modeling method of the secondary path. This method have some disadvantages, namely having to use additive noise with the result that noise control performance is not good since it is focused on the estimated performance of the secondary path. In this paper, we proposes a modified Kuo model using gain control parameter and delay. It uses a reference signal for additive noise to improve the problems in the existing Kuo model.
Feature Extraction by Optimizing the Cepstral Resolution of Frequency Sub-bands
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 35~41
Feature vectors for conventional speech recognition are usually extracted in full frequency band. Therefore, each sub-band contributes equally to final speech recognition results. In this paper, feature Teeters are extracted indepedently in each sub-band. The cepstral resolution of each sub-band feature is controlled for the optimal speech recognition. For this purpose, different dimension of each sub-band ceptral vectors are extracted based on the multi-band approach, which extracts feature vector independently for each sub-band. Speech recognition rates and clustering quality are suggested as the criteria for finding the optimal combination of sub-band Teeter dimension. In the connected digit recognition experiments using TIDIGITS database, the proposed method gave string accuracy of 99.125%, 99.775% percent correct, and 99.705% percent accuracy, which is 38%, 32% and 37% error rate reduction relative to baseline full-band feature vector, respectively.
A New Teat Data Generation for SPRT in Speaker Verification
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 42~47
This paper proposes the method to generate new test data using the sample shift of the start frame for SPRT(sequential probability ratio test) in speaker verification. The SPRT method is a effective algorithm that can reduce the test computational complexity. However, in making the decision procedure, SPRT can be executed on the assumption that the input samples are usually to be i.i.d. (Independent and Identically Distributed) samples from a probability density function (pdf), also it's not suitable method to apply for the short utterance. The proposed method can achieve SPRT regardless of the utterance length of the test data because it is method to generate the new test data through the sample shift of start frame. Also, the correlation property of data to be considered in the SPRT method can be effectively removed by employing the principal component analysis. Experimental results show that the proposed method increased the computational complexity of data for sample shift a little, but it has a good performance result more than a conventional method above the average 0.7% in EER (equal error rate).
Acoustic Channel Compensation at Mel-frequency Spectrum Domain
정소영 ; 오상훈 ; 이수영 ;
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 43~43
The effects of linear acoustic channels have been analyzed and compensated at mel-frequency feature domain. Unlike popular RASTA filtering our approach incorporates separate filters for each mel-frequency band, which results in better recognition performance for heavy-reverberated speeches.
Performance Improvement of Fast Speaker Adaptation Based on Dimensional Eigenvoice and Adaptation Mode Selection
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 48~53
Eigenvoice method is known to be adequate for fast speaker adaptation, but it hardly shows additional improvement with increased amount of adaptation data. In this paper, to deal with this problem, we propose a modified method estimating the weights of eigenvoices in each feature vector dimension. We also propose an adaptation mode selection scheme that one method with higher performance among several adaptation methods is selected according to the amount of adaptation data. We used POW DB to construct the speaker independent model and eigenvoices, and utterances(ranging from 1 to 50) from PBW 452 DB and the remaining 400 utterances were used for adaptation and evaluation, respectively. With the increased amount of adaptation data, proposed dimensional eigenvoice method showed higher performance than both conventional eigenvoice method and MLLR. Up to 26% of word error rate was reduced by the adaptation mode selection between eigenvoice and dimensional eigenvoice methods in comparison with conventional eigenvoice method.
Calculation of Radiation Impedance for Piston Sources on a Spherical Baffle
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 54~60
The characteristics of radiation impedance for piston source on a spherical baffle are analyzed by algorithms which consists of Finite Element Method (FEM) and Hybrid type Infinite Element Method (HIEM). The results of self-radiation impedance for radiation angle and mutual radiation impedance between piston sources coincided with other reports on the spherical rigid baffle. For the spherical non-rigid baffles, the variations of self-radiation impedance and mutual-radiation impedance are identified. Therefore, these results can be applied to design and radiation characteristics analysis of acoustic transducers.
Determination of Phase Velocity Dispersion Curve and Group Velocity of lamb Waves Using Backward Radiation
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 61~68
The guided wave has been widely employed to characterize thin plates and layered media. The dispersion curves of phase and group velocities are essential for the quantitative application of guided waves. In the present work, a fully automated system for the measurement of backward radiation of LLW has been developed. The specimen moves in two dimensional plane as well as in angular rotation. The signals of backward radiation of LLW were measured from an elastic plate in which specific modes of Lamb wave were strongly generated. Phase velocity of the corresponding modes was determined from the incident angle. The generated Lamb waves propagated forward and backward with the leakage of energy into water. Backward radiated LLW was detected by the same transducer and its frequency components were analyzed to extract the related information to the dispersion curves. The dispersion curves of phase velocity were measured by varying the incident angle. Moving the specimen in the linear direction of LLW propagation, group velocity was determined by measuring the transit time shift in the ultrasonic waveform.
Variability of Vertical Distribution of Volume Scattering Observed in the Shallow Water
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 69~77
Measurements of backscattered intensity were made over a shallow water using 300 ㎑and 1200 ㎑ bottom mounted ADCP (Acoustic Doppler Current Profiler) to determine the temporal variability of vertical distribution of high-frequency volume scattering strength (Sv). The variability of Sv in relatively deep water column(85 m and 113 m was due to the daily vertical migration, probably of larger zooplankton. However it was not found with 1200㎑ data at shallow water column. From the empirical orthogonal function (EOF) analysis using 1200㎑ data, the vertical distribution of the first mode eigenvectors of Sv is characterized by the presence of the maximum values near the bottom of the water.
Measurements of Backscattering Strength from Various Shapes of Sediment Surfaces and Layers
The Journal of the Acoustical Society of Korea, volume 22, issue 1, 2003, Pages 78~87
High-frequency (126-㎑) bottom backscattering measurements with various bottom types were conducted at the water tank in Ocean Acoustic Laboratory, Hanyang University. For the purpose of investigating the energy distribution of bottom scattering with various bottom types, the sediment was varied with gravel, sand, sandy mud and mixed bottoms. To examine the anisotropic nature of the scattering due to the orientations of bottom ripple, the footprints were made transverse and longitudinal to the direction of incident wave. The total scattering characteristics are that the larger grazing angles the larger backscattering strengths become and backscattering strengths for a transverse ripple case are higher than those of longitudinal ripple case. finally, the variations of scattering strength depend mainly on the ripple's orientation.