Search | Korea Science

Optimization of Gaussian Mixture in CDHMM Training for Improved Speech Recognition

Lee, Seo-Gu;Kim, Sung-Gil;Kang, Sun-Mee;Ko, Han-Seok
- Speech Sciences
- /
- v.5 no.1
- /
- pp.7-21
- /
- 1999
This paper proposes an improved training procedure in speech recognition based on the continuous density of the Hidden Markov Model (CDHMM). Of the three parameters (initial state distribution probability, state transition probability, output probability density function (p.d.f.) of state) governing the CDHMM model, we focus on the third parameter and propose an efficient algorithm that determines the p.d.f. of each state. It is known that the resulting CDHMM model converges to a local maximum point of parameter estimation via the iterative Expectation Maximization procedure. Specifically, we propose two independent algorithms that can be embedded in the segmental K -means training procedure by replacing relevant key steps; the adaptation of the number of mixture Gaussian p.d.f. and the initialization using the CDHMM parameters previously estimated. The proposed adaptation algorithm searches for the optimal number of mixture Gaussian humps to ensure that the p.d.f. is consistently re-estimated, enabling the model to converge toward the global maximum point. By applying an appropriate threshold value, which measures the amount of collective changes of weighted variances, the optimized number of mixture Gaussian branch is determined. The initialization algorithm essentially exploits the CDHMM parameters previously estimated and uses them as the basis for the current initial segmentation subroutine. It captures the trend of previous training history whereas the uniform segmentation decimates it. The recognition performance of the proposed adaptation procedures along with the suggested initialization is verified to be always better than that of existing training procedure using fixed number of mixture Gaussian p.d.f.
PDF

A Hardware Implementation of Moving Object Detection Algorithm using Gaussian Mixture Model (가우시안 혼합 모델을 이용한 이동 객체 검출 알고리듬의 하드웨어 구현)

Kim, Gyeong-hun;An, Hyo-Sik;Shin, Kyung-wook
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2015.05a
- /
- pp.407-409
- /
- 2015
In this paper, a hardware implementation of MOD(Moving Object Detection) algorithm is described, which is based GMM(Gaussian Mixture Model) and background subtraction. The EGML(Effective Gaussian Mixture Learning) is used to model and update background. Some approximations of EGML calculations are applied to reduce hardware complexity, and pipelining technique is used to improve operating speed. Gaussian parameters are adjustable according to various environment conditions to achieve better MOD performance. MOD processor is verified by using FPGA-in-the-loop verification, and it can operate with 109 MHz clock frequency on XC5VSX95T FPGA device.
PDF

Gaussian Mixture based K2 Rifle Chamber Pressure Modeling of M193 and K100 Bullets (가우시안 혼합모델 기반 탄종별 K2 소화기의 약실압력 모델링)

Kim, Jong-Hwan;Lee, Byounghwak;Kim, Kyoungmin;Shin, Kyuyong;Lee, Wonwoo
- Journal of the Korea Institute of Military Science and Technology
- /
- v.22 no.1
- /
- pp.27-34
- /
- 2019
This paper presents a chamber pressure model development of K2 rifle by applying Gaussian mixture model. In order to materialize a real recoil force of a virtual reality shooting rifle in military combat training, the chamber pressure which is one of major components of the recoil force needs to be investigated and modeled. Over 200,000 data of the chamber pressure were collected by implementing live fire experiments with both K100 and M193 of 5.56 mm bullets. Gaussian mixture method was also applied to create a mathematical model that satisfies nonlinear, asymmetry, and deviations of the chamber pressure which is caused by irregular characteristics of propellant combustion. In addition, Polynomial and Fourier Regression were used for comparison of results, and the sum of squared errors, the coefficient of determination and root-mean-square errors were analyzed for performance measurement.
https://doi.org/10.9766/KIMST.2019.22.1.027 인용 PDF KSCI HTML

Discrimination of Speech and Music signals using the Gaussian Mixture Model of Feature Parameters (추출된 특성 파라메터의 Gaussian Mixture Model 을 이용한 음성 및 음악 판별)

Jung Ki-Hoon;Lee Bong-Jin;Kim Kyung-Tae;Kang Hong-Goo
- Proceedings of the Korean Institute of Communication Sciences Conference
- /
- 2006.07a
- /
- pp.434-434
- /
- 2006
PDF

IMAGE DENOISING BASED ON MIXTURE DISTRIBUTIONS IN WAVELET DOMAIN

Bae, Byoung-Suk;Lee, Jong-In;Kang, Moon-Gi
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2009.01a
- /
- pp.246-249
- /
- 2009
Due to the additive white Gaussian noise (AWGN), images are often corrupted. In recent days, Bayesian estimation techniques to recover noisy images in the wavelet domain have been studied. The probability density function (PDF) of an image in wavelet domain can be described using highly-sharp head and long-tailed shapes. If a priori probability density function having the above properties would be applied well adaptively, better results could be obtained. There were some frequently proposed PDFs such as Gaussian, Laplace distributions, and so on. These functions model the wavelet coefficients satisfactorily and have its own of characteristics. In this paper, mixture distributions of Gaussian and Laplace distribution are proposed, which attempt to corporate these distributions' merits. Such mixture model will be used to remove the noise in images by adopting Maximum a Posteriori (MAP) estimation method. With respect to visual quality, numerical performance and computational complexity, the proposed technique gained better results.
PDF

Analysis and Implementation of Speech/Music Classification for 3GPP2 SMV Based on GMM (3GPP2 SMV의 실시간 음성/음악 분류 성능 향상을 위한 Gaussian Mixture Model의 적용)

Song, Ji-Hyun;Lee, Kye-Hwan;Chang, Joon-Hyuk
- The Journal of the Acoustical Society of Korea
- /
- v.26 no.8
- /
- pp.390-396
- /
- 2007
In this letter, we propose a novel approach to improve the performance of speech/music classification for the selectable mode vocoder(SMV) of 3GPP2 using the Gaussian mixture model(GMM) which is based on the expectation-maximization(EM) algorithm. We first present an effective analysis of the features and the classification method adopted in the conventional SMV. And then feature vectors which are applied to the GMM are selected from relevant Parameters of the SMV for the efficient speech/music classification. The performance of the proposed algorithm is evaluated under various conditions and yields better results compared with the conventional scheme of the SMV.
PDF KSCI

A Gaussian Mixture Model Based Pattern Classification Algorithm of Forearm Electromyogram (Gaussian Mixture Model 기반 전완 근전도 패턴 분류 알고리즘)

Song, Y.R.;Kim, S.J.;Jeong, E.C.;Lee, S.M.
- Journal of rehabilitation welfare engineering & assistive technology
- /
- v.5 no.1
- /
- pp.95-101
- /
- 2011
In this paper, we propose the gaussian mixture model based pattern classification algorithm of forearm electromyogram. We define the motion of 1-degree of freedom as holding and unfolding hand considering a daily life for patient with prosthetic hand. For the extraction of precise features from the EMG signals, we use the difference absolute mean value(DAMV) and the mean absolute value(MAV) to consider amplitude characteristic of EMG signals. We also propose the D_DAMV and D_MAV in order to classify the amplitude characteristic of EMG signals more precisely. In this paper, we implemented a test targeting four adult male and identified the accuracy of EMG pattern classification of two motions which are holding and unfolding hand.
PDF KSCI

A Neuro-Fuzzy Modeling using the Hierarchical Clustering and Gaussian Mixture Model (계층적 클러스터링과 Gaussian Mixture Model을 이용한 뉴로-퍼지 모델링)

Kim, Sung-Suk;Kwak, Keun-Chang;Ryu, Jeong-Woong;Chun, Myung-Geun
- Journal of the Korean Institute of Intelligent Systems
- /
- v.13 no.5
- /
- pp.512-519
- /
- 2003
In this paper, we propose a neuro-fuzzy modeling to improve the performance using the hierarchical clustering and Gaussian Mixture Model(GMM). The hierarchical clustering algorithm has a property of producing unique parameters for the given data because it does not use the object function to perform the clustering. After optimizing the obtained parameters using the GMM, we apply them as initial parameters for Adaptive Network-based Fuzzy Inference System. Here, the number of fuzzy rules becomes to the cluster numbers. From this, we can improve the performance index and reduce the number of rules simultaneously. The proposed method is verified by applying to a neuro-fuzzy modeling for Box-Jenkins s gas furnace data and Sugeno's nonlinear system, which yields better results than previous oiles.
https://doi.org/10.5391/JKIIS.2003.13.5.512 인용 PDF KSCI

A New Distance Measure for a Variable-Sized Acoustic Model Based on MDL Technique

Cho, Hoon-Young;Kim, Sang-Hun
- ETRI Journal
- /
- v.32 no.5
- /
- pp.795-800
- /
- 2010
Embedding a large vocabulary speech recognition system in mobile devices requires a reduced acoustic model obtained by eliminating redundant model parameters. In conventional optimization methods based on the minimum description length (MDL) criterion, a binary Gaussian tree is built at each state of a hidden Markov model by iteratively finding and merging similar mixture components. An optimal subset of the tree nodes is then selected to generate a downsized acoustic model. To obtain a better binary Gaussian tree by improving the process of finding the most similar Gaussian components, this paper proposes a new distance measure that exploits the difference in likelihood values for cases before and after two components are combined. The mixture weight of Gaussian components is also introduced in the component merging step. Experimental results show that the proposed method outperforms MDL-based optimization using either a Kullback-Leibler (KL) divergence or weighted KL divergence measure. The proposed method could also reduce the acoustic model size by 50% with less than a 1.5% increase in error rate compared to a baseline system.
https://doi.org/10.4218/etrij.10.1510.0062 인용 PDF KSCI

Feature Selection for Multi-Class Genre Classification using Gaussian Mixture Model (Gaussian Mixture Model을 이용한 다중 범주 분류를 위한 특징벡터 선택 알고리즘)

Moon, Sun-Kuk;Choi, Tack-Sung;Park, Young-Cheol;Youn, Dae-Hee
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.32 no.10C
- /
- pp.965-974
- /
- 2007
In this paper, we proposed the feature selection algorithm for multi-class genre classification. In our proposed algorithm, we developed GMM separation score based on Gaussian mixture model for measuring separability between two genres. Additionally, we improved feature subset selection algorithm based on sequential forward selection for multi-class genre classification. Instead of setting criterion as entire genre separability measures, we set criterion as worst genre separability measure for each sequential selection step. In order to assess the performance proposed algorithm, we extracted various features which represent characteristics such as timbre, rhythm, pitch and so on. Then, we investigate classification performance by GMM classifier and k-NN classifier for selected features using conventional algorithm and proposed algorithm. Proposed algorithm showed improved performance in classification accuracy up to 10 percent for classification experiments of low dimension feature vector especially.
PDF KSCI

Search Result 414, Processing Time 0.036 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)