Go to the main menu
Skip to content
Go to bottom
REFERENCE LINKING PLATFORM OF KOREA S&T JOURNALS
> Journal Vol & Issue
The Journal of the Acoustical Society of Korea
Journal Basic Information
Journal DOI :
The Acoustical Society of Korea
Editor in Chief :
Volume & Issues
Volume 28, Issue 8 - Nov 2009
Volume 28, Issue 7 - Oct 2009
Volume 28, Issue 6 - Aug 2009
Volume 28, Issue 5 - Jul 2009
Volume 28, Issue 4 - May 2009
Volume 28, Issue 3 - Apr 2009
Volume 28, Issue 2 - Feb 2009
Volume 28, Issue 1 - Jan 2009
Selecting the target year
Selection of Personalized Head Related Transfer Function Using a Binary Search tree
Lee, Ki-Seung ; Lee, Seok-Pil ;
The Journal of the Acoustical Society of Korea, volume 28, issue 5, 2009, Pages 409~415
The head-related transfer function (HRTF), which has an important role in virtual sound localization has different characteristics across the subjects. Measuring HRTF is very time-consuming and requires a set of specific apparatus. Accordingly, HRTF customization is often employed. In this paper, we propose a method to search an adequate HRTF from a set of the HRTFs. To achieve rapid and reliable customization of HRTF, all HRTFs in the database are partitioned, where a binary search tree was employed. The distortion measurement adopted in HRTF partitioning was determined in a heuristic way, which predicts the differences in perceived sound location well. The DC-Davis CIPIC HRTF database set was used to evaluate the effectiveness of the proposed method. In the listening test, where 10 subjects were participated, the stimuli filtered by the HRTF obtained by the proposed method were closer to those by the personalized HRTF in terms of sound localization. Moreover, performance of the proposed method was shown to be superior to the previous customization method, where the HRFT is selected by using anthropometric data.
Feasibility Study on Detection of Defective Elements in a Linear Phased Array Transducer through Ultrasonic Field Analysis and Visualization
Choi, Kwang-Yoon ; Yang, Jeong-Won ; Ha, Kang-Lyeol ; Kim, Moo-Joon ; Kim, Jung-Soon ; Lee, Chae-Bong ;
The Journal of the Acoustical Society of Korea, volume 28, issue 5, 2009, Pages 416~423
The ultrasonic pressure fields for the 3 MHz linear phased array transducer with sixteen piezoelectric elements of which one may not be operated by defect were simulated theoretically and measured experimentally using a visualization system of the Schlieren method. The simulation results for steering angles of
show that the side-lobe patterns of the transducer including a defective element is quite different from the transducer with all normal elements, and those patterns are in good agreement with the results of visualization. It is shown that the defective elements in a linear array transducer can be detected by comparison of the simulated and the visualized side-lobe patterns in two dimensional acoustic fields.
Sound Velocity Property of Sediment Containing Gas Hydrate in the Ulleung Basin, East Sea
Kim, Gil-Young ; Yoo, Dong-Geun ; Ryu, Byong-Jae ;
The Journal of the Acoustical Society of Korea, volume 28, issue 5, 2009, Pages 424~431
This study investigates the difference of sound velocity (compressional wave velocity) between gas hydrate-bearing sediments and nongas hydrate-bearing sediments in the Ulleung Basin, East Sea. We use a dataset measured from one site in the central part of the Ulleung Basin. Sound velocity for gas hydrate-bearing sediment shows the range from 1600 m/s to 2200 m/s. However, the value for nongas hydrate-bearing sediment is mostly around 1500 m/s, being less than 1400 m/s below 140 m subbottom depth. This trend is probably due to the presence of free gas below BSR (Bottom Simulating Reflector). Gas hydrate-bearing sediments show high value (maximum 150 Ohm-m) of resistivity. The physical properties between gas hydrate-bearing sediment and nongas hydrate-bearing sediment are characterized by the different patterns due to the presence of gas hydrate in comparison with those of marine unconsolidated sediments. Therefore, in order to investigate acoustic and physical properties for gas hydrate-bearing sediments, the study for the occurrence type and the amount of gas hydrates should be conducted simultaneously.
A Study on the OFDM System with Clipping Method for Underwater Communication
Han, Jeong-Woo ; Kim, Se-Young ; Kim, Ki-Man ; Jung, Ji-Won ; Kim, Seung-Joo ; Chun, Seung-Yong ; Son, Kwon ; Dho, Kyoung-Cheol ;
The Journal of the Acoustical Society of Korea, volume 28, issue 5, 2009, Pages 432~440
Recently, the necessity of underwater acoustic communication and demand for transmitting and receiving various data such as voice or high resolution image data are increasing as well. The performance of underwater acoustic communication system is influenced by underwater channel characteristic. Especially, a delay spread caused by reverberation and multi-path induces the ISI (Inter-Symbol-Interference) and reduces the communication performance. In this paper, we study the OFDM (Orthogonal Frequency Division Multiplexing) technique for robust the delay spread in underwater channel. We also use the clipping method to overcome the performance degradation in high PAPR (Peak-to-Average Power Ratio). We confirm the performances of underwater communication system by the underwater channel model simulation model and experiment in small water tank. As a result, the multi-carrier modulation with clipping method presented low BER and the previous single carrier modulation had high BER.
Automatic Indexing Algorithm of Golf Video Using Audio Information
Kim, Hyoung-Gook ;
The Journal of the Acoustical Society of Korea, volume 28, issue 5, 2009, Pages 441~446
This paper proposes an automatic indexing algorithm of golf video using audio information. In the proposed algorithm, the input audio stream is demultiplexed into the stream of video and audio. By means of Adaboost-cascade classifier, the continuous audio stream is classified into announcer's speech segment recorded in studio, music segment accompanied with players' names on TV screen, reaction segment of audience according to the play, reporter's speech segment with field background, filed noise segment like wind or waves. And golf swing sound including drive shot, iron shot, and putting shot is detected by the method of impulse onset detection and modulation spectrum verification. The detected swing and applause are used effectively to index action or highlight unit. Compared with video based semantic analysis, main advantage of the proposed system is its small computation requirement so that it facilitates to apply the technology to embedded consumer electronic devices for fast browsing.
Voice Activity Detection Method Using Psycho-Acoustic Model Based on Speech Energy Maximization in Noisy Environments
Choi, Gab-Keun ; Kim, Soon-Hyob ;
The Journal of the Acoustical Society of Korea, volume 28, issue 5, 2009, Pages 447~453
This paper introduces the method for detect voices and exact end point at low SNR by maximizing voice energy. Conventional VAD (Voice Activity Detection) algorithm estimates noise level so it tends to detect the end point inaccurately. Moreover, because it uses relatively long analysis range for reflecting temporal change of noise, computing load too high for application. In this paper, the SEM-VAD (Speech Energy Maximization-Voice Activity Detection) method which uses psycho-acoustical bark scale filter banks to maximize voice energy within frames is introduced. Stable threshold values are obtained at various noise environments (SNR 15 dB, 10 dB, 5 dB, 0 dB). At the test for voice detection in car noisy environment, PHR (Pause Hit Rate) was 100%accurate at every noise environment, and FAR (False Alarm Rate) shows 0% at SNR15 dB and 10 dB, 5.6% at SNR5 dB and 9.5% at SNR0 dB.
A Residual Echo and Noise Reduction Scheme with Linear Prediction for Hands-Free Telephony
Hwang, Kyung-Rok ; Son, Kyung-Sik ; Kim, Hyun-Tae ;
The Journal of the Acoustical Society of Korea, volume 28, issue 5, 2009, Pages 454~460
In this paper, we propose a residual echo and noise reduction scheme by using linear predictor for hands-free telephony applications. The proposed scheme whitens residual echo by the linear prediction during the non double-talk. But whitened residual echo signal still has speech characteristics. In this scheme, the whitened residual echo signal is more whitened by using the power of the linear prediction error signal and the linear predicted signal. After whitening process, near-end speech and ambient noise is present during double-talk but white noise will appear during non double-talk situation. By linearly predicting again the combined signal of the near-end speech and the whitened signal, the ambient noise is removed. Through computer simulation, it is shown that the proposed method performs well at the side of AIC (acoustic interference cancellation).
Performance Improvement of CPSP Based TDOA Estimation Using the Preemphasis
Kwon, Hong-Seok ; Bae, Keun-Sung ;
The Journal of the Acoustical Society of Korea, volume 28, issue 5, 2009, Pages 461~470
We investigate and analyze the problems encountered in frame-based estimation of TDOA (Time Difference of Arrival) using CPSP function. Spectral leakage occurring in framing of a speech signal by a rectangular window could make estimation of CPSP spectrum inaccurate. Framing with other windows to reduce the spectral leakage distorts the signal due to the asynchronous weighting around the frame specifically both ends of the frame. These problems degrade the performance of the CPSP-based TDOA estimation. In this paper, we propose a method to alleviate those problems by pre-emphasis of the speech signal. It reduces the influence of the spectral leakage by reducing dynamic range of the spectrum of a speech signal with pre-emphasis. To validate the proposed method of pre-emphasis, we carry out TDOA estimation experiments in various noise and reverberation conditions, Experimental results have shown that the framing of pre-emphasized microphone output by a rectangular window achieves higher success rate of TDOA estimation than any other framing methods.
Analysis and Implementation of Speech/Music Classification for 3GPP2 SMV Codec Employing SVM Based on Discriminative Weight Training
Kim, Sang-Kyun ; Chang, Joon-Hyuk ; Cho, Ki-Ho ; Kim, Nam-Soo ;
The Journal of the Acoustical Society of Korea, volume 28, issue 5, 2009, Pages 471~476
In this paper, we apply a discriminative weight training to a support vector machine (SVM) based speech/music classification for the selectable mode vocoder (SMV) of 3GPP2. In our approach, the speech/music decision rule is expressed as the SVM discriminant function by incorporating optimally weighted features of the SMV based on a minimum classification error (MCE) method which is different from the previous work in that different weights are assigned to each the feature of SMV. The performance of the proposed approach is evaluated under various conditions and yields better results compared with the conventional scheme in the SVM.
A Study on Robust Emotion Classification Structure Between Heterogeneous Speech Databases
Yoon, Won-Jung ; Park, Kyu-Sik ;
The Journal of the Acoustical Society of Korea, volume 28, issue 5, 2009, Pages 477~482
The emotion recognition system in commercial environments such as call-center undergoes severe system performance degradation and instability due to the speech characteristic differences between the system training database and the input speech of unspecified customers. In order to alleviate these problems, this paper extends traditional method of emotion recognition of neutral/anger into two-step hierarchical structure by using emotional characteristic changes and differences of male and female. The experimental results indicate that the proposed method provides very stable and successful emotional classification performance about 25% over the traditional method of emotion recognition.
Speech Interface with Echo Canceller and Barge- In Functionality for Telematic System
Kim, Jun ; Bae, Keun-Sung ;
The Journal of the Acoustical Society of Korea, volume 28, issue 5, 2009, Pages 483~490
In this paper, we develop a speech interface that has acoustic echo cancelling and barge-in functionalities in the car environment. In the echo canceller, DT (Double-Talk) detection algorithm using the correlation coefficients between reference and desired signals can make DT detection errors often in the background noise. We reduce the DT detection errors by using the average power of noise and echo estimated from the input signal. In addition, to make it possible for drivers to give speech command to the system by interrupting the speaker output, barge-in functionality is implemented with the combination of DT detection and appropriate gain control of the speaker output. Through the computer simulation with the assumed car environment and experiment in the real laboratory environment, implemented speech interface has shown good performance in removing acoustic echo signals in the noisy environment with proper operation of barge-in functionality.