Search | Korea Science

Audio Context Recognition Using Signal's Reconstructed Phase Space (신호의 복원된 위상 공간을 이용한 오디오 상황 인지)

Vinh, La The;Khattak, Asad Masood;Loan, Trinh Van;Lee, Sungyoung;Lee, Young-Ko
- Proceedings of the Korea Information Processing Society Conference
- /
- 2009.11a
- /
- pp.243-244
- /
- 2009
So far, many researches have been conducted in the area of audio based context recognition. Nevertheless, most of them are based on existing feature extraction techniques derived from linear signal processing such as Fourier transform, wavelet transform, linear prediction... Meanwhile, environmental audio signal may potentially contains non-linear dynamic properties. Therefore, it is a big potential to utilize non-linear dynamic signal processing techniques in audio based context recognition.
https://doi.org/10.3745/PKIPS.y2009m11a.243 인용 PDF

Design and Fabrication of VTR Audio Signal Processor IC (VTR 음성신호 처리용 집적회로의 설계 및 제작)

Shin, Myung-Chul
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.24 no.4
- /
- pp.618-624
- /
- 1987
This paper describes the design and fabrication of a signal processing integrated circuit required for the recording and playback of VTR audio signal. The integrated circuit was designed using 8\ulcorner design rule and its chip size is 2.5x2.5mm\ulcorner It was fabricated using SST bipolar standard process technology. The measurement analysis of the fabricated circuit proves the satisfactory DC characteristics and its proper audio signal processing funcstion.
PDF

A Single-Chip Video/Audio CODEC for Low Bit Rate Application

Park, Seong-Mo;Kim, Seong-Min;Kim, Ig-Kyun;Byun, Kyung-Jin;Cha, Jin-Jong;Cho, Han-Jin
- ETRI Journal
- /
- v.22 no.1
- /
- pp.20-29
- /
- 2000
In this paper, we present a design of video and audio single chip encoder/decoder for portable multimedia application. The single-chip called as video audio signal processor (VASP) consists of a video signal processing block and an audio single processing block. This chip has mixed hardware/software architecture to combine performance and flexibility. We designed the chip by partitioning between video and audio block. The video signal processing block was designed to implement hardware solution of pixel input/output, full pixel motion estimation, half pixel motion estimation, discrete cosine transform, quantization, run length coding, host interface, and 16 bits RISC type internal controller. The audio signal processing block is implemented with software solution using a 16 bits fixed point DSP. This chip contains 142,300 gates, 22 Kbits FIFO, 107 kbits SRAM, and 556 kbits ROM, and the chip size is $9.02mm{\times}9.06mm$ which is fabricated using 0.5 micron 3-layer metal CMOS technology.
PDF

High Embedding Capacity and Robust Audio Watermarking for Secure Transmission Using Tamper Detection

Kaur, Arashdeep;Dutta, Malay Kishore
- ETRI Journal
- /
- v.40 no.1
- /
- pp.133-145
- /
- 2018
Robustness, payload, and imperceptibility of audio watermarking algorithms are contradictory design issues with high-level security of the watermark. In this study, the major issue in achieving high payload along with adequate robustness against challenging signal-processing attacks is addressed. Moreover, a security code has been strategically used for secure transmission of data, providing tamper detection at the receiver end. The high watermark payload in this work has been achieved by using the complementary features of third-level detailed coefficients of discrete wavelet transform where the human auditory system is not sensitive to alterations in the audio signal. To counter the watermark loss under challenging attacks at high payload, Daubechies wavelets that have an orthogonal property and provide smoother frequencies have been used, which can protect the data from loss under signal-processing attacks. Experimental results indicate that the proposed algorithm has demonstrated adequate robustness against signal processing attacks at 4,884.1 bps. Among the evaluators, 87% have rated the proposed algorithm to be remarkable in terms of transparency.
https://doi.org/10.4218/etrij.2017-0092 인용 PDF KSCI

Serial Transmission of Audio Signals for Multi-channel Speaker Systems (다채널 스피커 시스템을 위한 오디오 신호지 직렬 전송)

Kwon, Oh-Kyun;Song, Moon-Vin;Lee, Seung-Won;Lee, Young-Won;Chung, Yun-Mo
- The Journal of the Acoustical Society of Korea
- /
- v.24 no.7
- /
- pp.387-394
- /
- 2005
In this paper, we propose a new transmission technique of audio signals for the serial connection of the speakers of multiple-channel audio systems. Analog audio signals from a multi-channel audio system are converted into digital signals with signal processing steps and transferred to each speaker through a serial line. The signal processing steps contain data compression and packet generation in association with audio signal characteristics. Each speaker gets its corresponding digital audio signals from the transmitted packets and converts the signals into analog audio signals to make sounds with the speaker All the proposed functions in this paper are modeled in VHDL. implemented with FPGA chips, and tested for actual multi-channel audio systems.
PDF KSCI

An Efficient Audio Watermark Extraction in Time Domain

Kang, Hae-Won;Jung, Sung-Hwan
- Journal of Information Processing Systems
- /
- v.2 no.1
- /
- pp.13-17
- /
- 2006
In this paper, we propose an audio extraction method to decrease the influence of the original signal by modifying the watermarking detection system proposed by P. Bassia et al. In the extraction of the watermark, we employ a simple mean filter to remove the influence of the original signal as a preprocessing of extraction and the repetitive insertion of the watermark. As the result of the experiment, for which we used about 20 kinds of actual audio data, we obtain a watermark detection rate of about 95% and a good performance even after the various signal processing attacks.
https://doi.org/10.3745/JIPS.2006.2.1.013 인용 PDF KSCI

A Study on the Signal Processing for Content-Based Audio Genre Classification (내용기반 오디오 장르 분류를 위한 신호 처리 연구)

윤원중;이강규;박규식
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.41 no.6
- /
- pp.271-278
- /
- 2004
In this paper, we propose a content-based audio genre classification algorithm that automatically classifies the query audio into five genres such as Classic, Hiphop, Jazz, Rock, Speech using digital sign processing approach. From the 20 seconds query audio file, the audio signal is segmented into 23ms frame with non-overlapped hamming window and 54 dimensional feature vectors, including Spectral Centroid, Rolloff, Flux, LPC, MFCC, is extracted from each query audio. For the classification algorithm, k-NN, Gaussian, GMM classifier is used. In order to choose optimum features from the 54 dimension feature vectors, SFS(Sequential Forward Selection) method is applied to draw 10 dimension optimum features and these are used for the genre classification algorithm. From the experimental result, we can verify the superior performance of the proposed method that provides near 90% success rate for the genre classification which means 10%∼20% improvements over the previous methods. For the case of actual user system environment, feature vector is extracted from the random interval of the query audio and it shows overall 80% success rate except extreme cases of beginning and ending portion of the query audio file.
PDF KSCI

A Beamforming-Based Video-Zoom Driven Audio-Zoom Algorithm for Portable Digital Imaging Devices

Park, Nam In;Kim, Seon Man;Kim, Hong Kook;Kim, Myeong Bo;Kim, Sang Ryong
- IEIE Transactions on Smart Processing and Computing
- /
- v.2 no.1
- /
- pp.11-19
- /
- 2013
A video-zoom driven audio-zoom algorithm is proposed to provide audio zooming effects according to the degree of video-zoom. The proposed algorithm is designed based on a super-directive beamformer operating with a 4-channel microphone array in conjunction with a soft masking process that uses the phase differences between microphones. The audio-zoom processed signal is obtained by multiplying the audio gain derived from the video-zoom level by the masked signal. The proposed algorithm is then implemented on a portable digital imaging device with a clock speed of 600 MHz after different levels of optimization, such as algorithmic level, C-code and memory optimization. As a result, the processing time of the proposed audio-zoom algorithm occupies 14.6% or less of the clock speed of the device. The performance evaluation conducted in a semi-anechoic chamber shows that the signals from the front direction can be amplified by approximately 10 dB compared to the other directions.
PDF

Modeling and Analysis of Class D Audio Amplifiers using Control Theories (제어이론을 이용한 D급 디지털 오디오 증폭기의 모델링과 해석)

Ryu, Tae-Ha;Ryu, Ji-Yeol;Doh, Tae-Yong
- Journal of Institute of Control, Robotics and Systems
- /
- v.13 no.4
- /
- pp.385-391
- /
- 2007
A class D digital audio amplifier with small size, low cost, and high quality is positively necessary in the multimedia era. Since the digital audio amplifier is based on the PWM signal processing, it is improper to analyze the principle of signal generation using linear system theories. In this paper, a class D digital audio amplifier based ADSM (Advanced Delta-Sigma Modulation) is considered. We first model the digital audio amplifier and then explain the operation principle using variable structure control algorithm. Moreover, the ripple signal generated by the hysteresis in the comparator has a significant effect on the system performance. Thus, we present a method to find the magnitude and the frequency of the ripple signal using describing function. Finally, simulations and experiments are provided to show the validity of the proposed methods.
https://doi.org/10.5302/J.ICROS.2007.13.4.385 인용 PDF KSCI

Audio Watermarking Using Empirical Mode Decomposition (경험적 모드 분해법을 이용한 오디오 워터마킹)

Nguyen, Phuong;Kim, Jong-Myon
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2014.01a
- /
- pp.89-92
- /
- 2014
This paper presents a secure and blind adaptive audio watermarking algorithm based on Empirical Mode Decomposition (EMD). The audio signal is divided into frames and each one is decomposed adaptively, by EMD, into several Intrinsic Mode Functions (IMFs). The watermark and the synchronization codes are then embedded into the extrema of the last IMF. The experimental results show that the proposed method has good imperceptibility and robustness against signal processing attacks.
PDF

Search Result 155, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)