Search | Korea Science

Lee, Sang-Min;Yeon, Jun-Sang;Kim, Ji-Soo;Kim, Sung-Soo
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.61 no.9
- /
- pp.1336-1339
- /
- 2012
In this paper, we focus on solving the classification problem by using semisupervised learning strategy. Traditional classifiers are constructed based on labeled data in supervised learning. Labeled data, however, are often difficult, expensive or time consuming to obtain, as they require the efforts of experienced human annotators. Unlabeled data are significantly easier to obtain without human efforts. Thus, we use AdaBoost algorithm with SVM-KNN classifier to apply semisupervised learning problem and improve the classifier performance. Experimental results on both artificial and UCI data sets show that the proposed methodology can reduce the error rate.
https://doi.org/10.5370/KIEE.2012.61.9.1336 인용 PDF KSCI

Seok, Kyungha
- Journal of the Korean Data and Information Science Society
- /
- v.26 no.2
- /
- pp.517-524
- /
- 2015
Unlabeled examples are easier and less expensive to be obtained than labeled examples. In this paper semisupervised approach is used to utilize such examples in an effort to enhance the predictive performance of nonlinear quantile regression problems. We propose a semisupervised quantile regression method named semisupervised support vector quantile regression, which is based on support vector machine. A generalized approximate cross validation method is used to choose the hyper-parameters that affect the performance of estimator. The experimental results confirm the successful performance of the proposed S2SVQR.
https://doi.org/10.7465/jkdi.2015.26.2.517 인용 PDF KSCI

Shim, Jooyong;Seok, Kyungha
- Journal of the Korean Data and Information Science Society
- /
- v.25 no.2
- /
- pp.455-464
- /
- 2014
Unlabeled examples are easier and less expensive to obtain than labeled examples. Semisupervised approaches are used to utilize such examples in an eort to boost the predictive performance. This paper proposes a novel semisupervised classication method named transductive least squares support vector machine (TLS-SVM), which is based on the least squares support vector machine. The proposed method utilizes the dierence convex algorithm to derive nonconvex minimization solutions for the TLS-SVM. A generalized cross validation method is also developed to choose the hyperparameters that aect the performance of the TLS-SVM. The experimental results conrm the successful performance of the proposed TLS-SVM.
https://doi.org/10.7465/jkdi.2014.25.2.455 인용 PDF KSCI

Vinay Padimi;Venkata Sravan Telu;Devarani Devi Ningombam
- ETRI Journal
- /
- v.45 no.6
- /
- pp.1007-1021
- /
- 2023
Stroke is the leading cause of permanent disability in adults, and it can cause permanent brain damage. According to the World Health Organization, 795 000 Americans experience a new or recurrent stroke each year. Early detection of medical disorders, for example, strokes, can minimize the disabling effects. Thus, in this paper, we consider various risk factors that contribute to the occurrence of stoke and machine learning algorithms, for example, the decision tree, random forest, and naive Bayes algorithms, on patient characteristics survey data to achieve high prediction accuracy. We also consider the semisupervised self-training technique to predict the risk of stroke. We then consider the near-miss undersampling technique, which can select only instances in larger classes with the smaller class instances. Experimental results demonstrate that the proposed method obtains an accuracy of approximately 98.83% at low cost, which is significantly higher and more reliable compared with the compared techniques.
https://doi.org/10.4218/etrij.2022-0271 인용 PDF

Byung Ok Kang;Hyung-Bae Jeon;Yun Kyung Lee
- ETRI Journal
- /
- v.46 no.1
- /
- pp.48-58
- /
- 2024
This paper presents the development of language tutoring systems for nonnative speakers by leveraging advanced end-to-end automatic speech recognition (ASR) and proficiency evaluation. Given the frequent errors in non-native speech, high-performance spontaneous speech recognition must be applied. Our systems accurately evaluate pronunciation and speaking fluency and provide feedback on errors by relying on precise transcriptions. End-to-end ASR is implemented and enhanced by using diverse non-native speaker speech data for model training. For performance enhancement, we combine semisupervised and transfer learning techniques using labeled and unlabeled speech data. Automatic proficiency evaluation is performed by a model trained to maximize the statistical correlation between the fluency score manually determined by a human expert and a calculated fluency score. We developed an English tutoring system for Korean elementary students called EBS AI Peng-Talk and a Korean tutoring system for foreigners called KSI Korean AI Tutor. Both systems were deployed by South Korean government agencies.
https://doi.org/10.4218/etrij.2023-0322 인용 PDF

Kim, Hyun-Jung
- Journal of the Korean BIBLIA Society for library and Information Science
- /
- v.23 no.3
- /
- pp.5-17
- /
- 2012
In citation analysis, author names are often used as the unit of analysis and some authors are indexed under the same name in bibliographic databases where the citation counts are obtained from. There are many techniques for author name disambiguation, using supervised, unsupervised, or semisupervised learning algorithms. Unsupervised approach uses machine learning algorithms to extract necessary bibliographic information from large-scale databases and digital libraries, while supervised approaches use manually built training datasets for clustering author groups for combining them with learning algorithms for author name disambiguation. The study examines various techniques for author name disambiguation in the hope for finding an aid to improve the precision of citation counts in citation analysis, as well as for better results in information retrieval.
https://doi.org/10.14699/kbiblia.2012.23.3.005 인용 PDF KSCI