Go to the main menu
Skip to content
Go to bottom
REFERENCE LINKING PLATFORM OF KOREA S&T JOURNALS
> Journal Vol & Issue
The Journal of the Acoustical Society of Korea
Journal Basic Information
Journal DOI :
The Acoustical Society of Korea
Editor in Chief :
Volume & Issues
Volume 35, Issue 4 - Jul 2016
Volume 35, Issue 3 - May 2016
Volume 35, Issue 2 - Mar 2016
Volume 35, Issue 1 - Jan 2016
Selecting the target year
Centroid-model based music similarity with alpha divergence
Seo, Jin Soo ; Kim, Jeonghyun ; Park, Jihyun ;
The Journal of the Acoustical Society of Korea, volume 35, issue 2, 2016, Pages 83~91
DOI : 10.7776/ASK.2016.35.2.083
Music-similarity computation is crucial in developing music information retrieval systems for browsing and classification. This paper overviews the recently-proposed centroid-model based music retrieval method and applies the distributional similarity measures to the model for retrieval-performance evaluation. Probabilistic distance measures (also called divergence) compute the distance between two probability distributions in a certain sense. In this paper, we consider the alpha divergence in computing distance between two centroid models for music retrieval. The alpha divergence includes the widely-used Kullback-Leibler divergence and Bhattacharyya distance depending on the values of alpha. Experiments were conducted on both genre and singer datasets. We compare the music-retrieval performance of the distributional similarity with that of the vector distances. The experimental results show that the alpha divergence improves the performance of the centroid-model based music retrieval.
Interior surface treatment guidelines for classrooms according to the acoustical performance criteria
Ryu, Da-Jung ; Park, Chan-Jae ; Haan, Chan-Hoon ;
The Journal of the Acoustical Society of Korea, volume 35, issue 2, 2016, Pages 92~101
DOI : 10.7776/ASK.2016.35.2.092
There are many results in which acoustical conditions of a classroom play an important role for studying effects and academic achievement of students. However, there are very few guidelines or design proposals which could make appropriate acoustic environment when classrooms are built or renovated. The present study suggests various design proposals satisfying acoustic standards of classrooms based on theoretical calculation and acoustic field experiments. At first, minimum area of sound absorption was calculated which is required to satisfy the acoustic standard for domestic middle and high schools. Also, room acoustic measurements were carried out in order to investigate the acoustic performance of an existing classroom by changing interior finishing materials on ceiling and rear walls. As a result, it was revealed that reverberation time standard below 0.8 s can be acquired even if there is no sound absorption on ceiling which is a general practice executed in Korea. Specially, it was found that if partial area of ceiling would be treated as reflective with the ratio of sound absorption and reflection as 2:1, almost similar acoustic parameters of
, RASTI (Rapid Speech Transmission Index) and higher sound levels could be acquired in comparison with the case of entire sound absorption on ceiling.
Multi frequency band noise suppression system using signal-to-noise ratio estimation
Oh, In Kyu ; Lee, In Sung ;
The Journal of the Acoustical Society of Korea, volume 35, issue 2, 2016, Pages 102~109
DOI : 10.7776/ASK.2016.35.2.102
This paper proposes a noise suppression method through SNR (Singal-to Noise Ratio) estimation in the two microphone array environment of close spacing. The conventional method uses a noise suppression method for a gain function obtained through the SNR estimation based on coherence function from full band. However, this method cause performance decreased by the noise damage that affects all the feature vector component. So, we propose a noise suppression method that allocates a frequency domain signal into N constant multi frequency band and each frequency band gets a gain function through SNR estimation based on coherence function. Performance evaluation of the proposed method is shown by comparison with PESQ (Perceptual Evaluation of Speech Quality) value which is an objective quality evaluation method provided by the ITU-T (International Telecommunications Union Telecommunication).
Separation of passive sonar target signals using frequency domain independent component analysis
Lee, Hojae ; Seo, Iksu ; Bae, Keunsung ;
The Journal of the Acoustical Society of Korea, volume 35, issue 2, 2016, Pages 110~117
DOI : 10.7776/ASK.2016.35.2.110
Passive sonar systems detect and classify the target by analyzing the radiated noises from vessels. If multiple noise sources exist within the sonar detection range, it gets difficult to classify each noise source because mixture of noise sources are observed. To overcome this problem, a beamforming technique is used to separate noise sources spatially though it has various limitations. In this paper, we propose a new method that uses a FDICA (Frequency Domain Independent Component Analysis) to separate noise sources from the mixture. For experiments, each noise source signal was synthesized by considering the features such as machinery tonal components and propeller tonal components. And the results of before and after separation were compared by using LOFAR (Low Frequency Analysis and Recording), DEMON (Detection Envelope Modulation On Noise) analysis.
Contribution analysis of underwater radiation noise source using partial coherence function
Kim, Tae Hyeong ; Choi, Jae Yong ; Oh, Jun Seok ; Kim, Seong Yong ;
The Journal of the Acoustical Society of Korea, volume 35, issue 2, 2016, Pages 118~124
DOI : 10.7776/ASK.2016.35.2.118
In this paper, contribution analysis method using a partial coherence function is dealt with in the case of underwater radiation noise. When performing the contribution analysis using a partial coherence function, it is important to select the order of system input. But in the case of frequency correlated systems, it is very difficult to properly select the order of system input. In order to solve this problem, the contribution analysis is performed by subdividing the area of contribution using multiple coherence function. And the new contribution analysis method is presented by using the relationship between the contribution characteristic matrix and multiple coherence function. In order to validate the new method, calculation is performed about multi-input / single-output model which is composed of sine waves. The result of calculation shows that it is possible to derive the exact contribution values.
An analysis of the moving speed effect of the receiver array on the passive synthetic aperture signal processing
Kim, Sea-Moon ; Byun, Sung-Hoon ; Oh, Sehyun ;
The Journal of the Acoustical Society of Korea, volume 35, issue 2, 2016, Pages 125~133
DOI : 10.7776/ASK.2016.35.2.125
In order to obtain high-resolution seafloor images, research on SA (Synthetic Aperture) processing and the development of related underwater systems have been performed in many countries. Recently the SA processing is also recognized as an important technique in Korea and researchers started related basic study. However, most previous studies ignored the Doppler effect by a moving receiver array. In this paper reconstructed SAS (Synthetic Aperture Sonar) images and position errors are analyzed according to the speed of a moving array for understanding its moving effect on the SAS images. In the analysis the spatial frequency domain interpolation algorithm is used. The results show that as the moving speed of the array increases the estimated position error also increases and image distortion gets worse when we do not consider the array motion. However, if the compensated receiver signals considering the array motion are used the position error and image distortion can be eliminated. In conclusion a signal processing scheme which compensates the Doppler effect is necessary especially in the condition where the array speed is over 1 m/s.
Speech synthesis using acoustic Doppler signal
Lee, Ki-Seung ;
The Journal of the Acoustical Society of Korea, volume 35, issue 2, 2016, Pages 134~142
DOI : 10.7776/ASK.2016.35.2.134
In this paper, a method synthesizing speech signal using the 40 kHz ultrasonic signals reflected from the articulatory muscles was introduced and performance was evaluated. When the ultrasound signals are radiated to articulating face, the Doppler effects caused by movements of lips, jaw, and chin observed. The signals that have different frequencies from that of the transmitted signals are found in the received signals. These ADS (Acoustic-Doppler Signals) were used for estimating of the speech parameters in this study. Prior to synthesizing speech signal, a quantitative correlation analysis between ADS and speech signals was carried out on each frequency bin. According to the results, the feasibility of the ADS-based speech synthesis was validated. ADS-to-speech transformation was achieved by the joint Gaussian mixture model-based conversion rules. The experimental results from the 5 subjects showed that filter bank energy and LPC (Linear Predictive Coefficient) cepstrum coefficients are the optimal features for ADS, and speech, respectively. In the subjective evaluation where synthesized speech signals were obtained using the excitation sources extracted from original speech signals, it was confirmed that the ADS-to-speech conversion method yielded 72.2 % average recognition rates.
Sound event classification using deep neural network based transfer learning
Lim, Hyungjun ; Kim, Myung Jong ; Kim, Hoirin ;
The Journal of the Acoustical Society of Korea, volume 35, issue 2, 2016, Pages 143~148
DOI : 10.7776/ASK.2016.35.2.143
Deep neural network that effectively capture the characteristics of data has been widely used in various applications. However, the amount of sound database is often insufficient for learning the deep neural network properly, so resulting in overfitting problems. In this paper, we propose a transfer learning framework that can effectively train the deep neural network even with insufficient sound event data by employing rich speech or music data. A series of experimental results verify that proposed method performs significantly better than the baseline deep neural network that was trained only with small sound event data.
A clutter reduction algorithm based on clustering for active sonar systems
Kwak, ChulHyun ; Cheong, Myoung Jun ; Ahn, Jae-Kyun ;
The Journal of the Acoustical Society of Korea, volume 35, issue 2, 2016, Pages 149~157
DOI : 10.7776/ASK.2016.35.2.149
In this paper, we propose a new clutter reduction algorithm, which rejects heavy clutter density in shallow water environments, based on a clustering method. At first, it applies the density-based clustering to active sonar measurements by considering speed of targets, pulse repetition intervals, etc. We assume clustered measurements as target candidates and remove noise, which is a set of unclustered measurements. After clustering, we classify target and clutter measurements by the validation check method. We evaluate the performance of the proposed algorithm on synthetic data and sea-trial data. The results demonstrate that the proposed algorithm provides significantly better performances to reduce clutter than the conventional algorithm.
A study on the features of resolution of robust active sonar pulses from the reverberation environment
Jeong, Eui Cheol ;
The Journal of the Acoustical Society of Korea, volume 35, issue 2, 2016, Pages 158~165
DOI : 10.7776/ASK.2016.35.2.158
Many researches for improving detection performance in the reverberation environment have been conducted in active sonar systems. Especially the type of active pulse makes an impact on the detection performance in the reverberation environment. Thus, this paper describes the detection performance of PTFM (Pulse Trains of Frequency Modulated waveform), Costas, and Geometric Comb pulses which are known for their outstanding performance against the reverberation. Sea trial data of those pulses was analyzed and it was figured out that the range resolution of PTFM pulse was deteriorated by its sub-pulses. Costas pulse showed performance degradations of the doppler resolution by multipath signals. Geometric Comb pulse showed the best doppler resolution.