Go to the main menu
Skip to content
Go to bottom
REFERENCE LINKING PLATFORM OF KOREA S&T JOURNALS
> Journal Vol & Issue
The KIPS Transactions:PartB
Journal Basic Information
Journal DOI :
Korea Information Processing Society
Editor in Chief :
Volume & Issues
Volume 10B, Issue 7 - Dec 2003
Volume 10B, Issue 6 - Oct 2003
Volume 10B, Issue 5 - Aug 2003
Volume 10B, Issue 4 - Aug 2003
Volume 10B, Issue 3 - Jun 2003
Volume 10B, Issue 2 - Apr 2003
Volume 10B, Issue 1 - Feb 2003
Selecting the target year
Development of Automatic Accidents Detection Algorithm Using Image Sequence
Lee, Bong-Keun ; Lim, Joong-Seon ; Han, Min-Hong ;
The KIPS Transactions:PartB, volume 10B, issue 2, 2003, Pages 127~134
DOI : 10.3745/KIPSTB.2003.10B.2.127
This paper is intended to develop an algorithm for automatic detection of traffic accidents using image sequences. This algorithm is designed for detecting stopped vehicles traffic accidents, break down, illegal stop in the road shoulder - on the range of camera view. Virtual traps are set on accident-prone spots. We analyze the changes in gray levels of pixels on the virtual traps which represent the motion of vehicles on the corresponding spots. We verify the proposed algorithm by simulating some situations and checking if it detect them correctly.
3D Object`s shape and motion recovery using stereo image and Paraperspective Camera Model
Kim, Sang-Hoon ;
The KIPS Transactions:PartB, volume 10B, issue 2, 2003, Pages 135~142
DOI : 10.3745/KIPSTB.2003.10B.2.135
Robust extraction of 3D object`s features, shape and global motion information from 2D image sequence is described. The object`s 21 feature points on the pyramid type synthetic object are extracted automatically using color transform technique. The extracted features are used to recover the 3D shape and global motion of the object using stereo paraperspective camera model and sequential SVD(Singuiar Value Decomposition) factorization method. An inherent error of depth recovery due to the paraperspective camera model was removed by using the stereo image analysis. A 30 synthetic object with 21 features reflecting various position was designed and tested to show the performance of proposed algorithm by comparing the recovered shape and motion data with the measured values.
A Fast Error Concealment Using a Data Hiding Technique and a Robust Error Resilience for Video
Kim, Jin-Ok ;
The KIPS Transactions:PartB, volume 10B, issue 2, 2003, Pages 143~150
DOI : 10.3745/KIPSTB.2003.10B.2.143
Error concealment plays an important role in combating transmission errors. Methods of error concealment which produce better quality are generally of higher complexity, thus making some of the more sophisticated algorithms is not suitable for real-time applications. In this paper, we develop temporal and spatial error resilient video encoding and data hiding approach to facilitate the error concealment at the decoder. Block interleaving scheme is introduced to isolate erroneous blocks caused by packet losses for spatial area of error resilience. For temporal area of error resilience, data hiding is applied to the transmission of parity bits to protect motion vectors. To do error concealment quickly, a set of edge features extracted from a block is embedded imperceptibly using data hiding into the host media and transmitted to decoder. If some part of the media data is damaged during transmission, the embedded features are used for concealment of lost data at decoder. This method decreases a complexity of error concealment by reducing the estimation process of lost data from neighbor blocks. The proposed data hiding method of parity bits and block features is not influence much to the complexity of standard encoder. Experimental results show that proposed method conceals properly and effectively burst errors occurred on transmission channel like Internet.
Vergence Control of the Parallel-axis Stereo Camera using Signal Processing
Lee, Gwang-Soon ; Kim, Hyoung-Nam ; Hur, Nam-Ho ; Um, Gi-Mun ; Ahn, Chung-Hyun ;
The KIPS Transactions:PartB, volume 10B, issue 2, 2003, Pages 151~156
DOI : 10.3745/KIPSTB.2003.10B.2.151
The vergence control method is presented for a parallel-axls stereo camera (PASC) using a signal processing technique such as shift, (rotation), and scaling. The PASC is considered as the simplest one of binocular stereo cameras. However, its major limitation lies in the controllability of vergence since its left and right imaging sensors of CCDs are fixed. On the other hand, a horizontal-moving-axis stereo camera (HMASC) with movable imaging sensors is able to control the vergence by moving its CCDs horizontally. In spite of its vergence controllability, there is a major drawback in the implementation because of complicated mechanical structure and the additional cost. To overcome the vergence control problem of the PASC, an operational principle of the HMASC is applied to the PASC. To be specific, without any additional hardware the vergence control problem of the PASC is solved with the signal processing technique. Assuming the virtual displacement between CCD`s, a disappearing part of acquired images is removed and the original image site is recovered via interpolation. Experimental results show that the vergence control between stereo images captured by the PASC it possible with an acceptable degradation of the image quality defending on the virtual displacement of CCDs.
A Study on Photographic 3D Modeling Techniques for Their Enhancements
Lee, Jong-Soo ;
The KIPS Transactions:PartB, volume 10B, issue 2, 2003, Pages 157~162
DOI : 10.3745/KIPSTB.2003.10B.2.157
In this paper, in-depth considerations are given to various techniques suggested for photographic 3D modelling and possible enhancements of the techniques are discussed. It is found that both the improvement of fundamental matrix estimation techniques and the stereo image rectification processing stage are necessary for more accurate 3D modelling.
Error Recovery by the Classification of Candidate Motion Vectors for H.263 Video Communications
Son, Nam-Rye ; Lee, Guee-Sang ;
The KIPS Transactions:PartB, volume 10B, issue 2, 2003, Pages 163~168
DOI : 10.3745/KIPSTB.2003.10B.2.163
In transmitting compressed video bit-stream over Internet, packet loss causes error propagation in both spatial and temporal domain, which in turn leads to severe degradation in image quality. In this paper, a new approach for the recovery of lost or erroneous Motion Vector(MV)s by classifying the movements of neighboring blocks by their homogeneity is proposed. MVs of neighboring blocks are classified according to the direction of MVs and a representative value for each class is determined to obtain the candidate MV set. By computing the distortion of the candidates, a MV with the minimum distortion is selected. Experimental results show that the proposed algorithm exhibits better performance in many cases than existing methods.
The Design of Keyword Spotting System based on Auditory Phonetical Knowledge-Based Phonetic Value Classification
Kim, Hack-Jin ; Kim, Soon-Hyub ;
The KIPS Transactions:PartB, volume 10B, issue 2, 2003, Pages 169~178
DOI : 10.3745/KIPSTB.2003.10B.2.169
This study outlines two viewpoints the classification of phone likely unit (PLU) which is the foundation of korean large vocabulary speech recognition, and the effectiveness of Chiljongseong (7 Final Consonants) and Paljogseong (8 Final Consonants) of the korean language. The phone likely classifies the phoneme phonetically according to the location of and method of articulation, and about 50 phone-likely units are utilized in korean speech recognition. In this study auditory phonetical knowledge was applied to the classification of phone likely unit to present 45 phone likely unit. The vowels `ㅔ, ㅐ`were classified as phone-likely of (ee) ; `ㅒ, ㅖ` as [ye] ; and `ㅚ, ㅙ, ㅞ` as [we]. Secondly, the Chiljongseong System of the draft for unified spelling system which is currently in use and the Paljongseonggajokyong of Korean script haerye were illustrated. The question on whether the phonetic value on `ㄷ` and `ㅅ` among the phonemes used in the final consonant of the korean fan guage is the same has been argued in the academic world for a long time. In this study, the transition stages of Korean consonants were investigated, and Ciljonseeng and Paljongseonggajokyong were utilized in speech recognition, and its effectiveness was verified. The experiment was divided into isolated word recognition and speech recognition, and in order to conduct the experiment PBW452 was used to test the isolated word recognition. The experiment was conducted on about 50 men and women - divided into 5 groups - and they vocalized 50 words each. As for the continuous speech recognition experiment to be utilized in the materialized stock exchange system, the sentence corpus of 71 stock exchange sentences and speech corpus vocalizing the sentences were collected and used 5 men and women each vocalized a sentence twice. As the result of the experiment, when the Paljongseonggajokyong was used as the consonant, the recognition performance elevated by an average of about 1.45% : and when phone likely unit with Paljongseonggajokyong and auditory phonetic applied simultaneously, was applied, the rate of recognition increased by an average of 1.5% to 2.02%. In the continuous speech recognition experiment, the recognition performance elevated by an average of about 1% to 2% than when the existing 49 or 56 phone likely units were utilized.
Emotion from Color images and Its Application to Content-based Image Retrievals
Park, Joong-Soo ; Eum, Kyoung-Bae ; Shin, Kyung-Hae ; Lee, Joon-Whoan ; Park, Dong-Sun ;
The KIPS Transactions:PartB, volume 10B, issue 2, 2003, Pages 179~188
DOI : 10.3745/KIPSTB.2003.10B.2.179
In content-based image retrieval, the query is an image itself and the retrieval process is the process that seeking the similar images to the given query image. In this way of retrieval, the user has to know the basic physical features of target images that he wants to retrieve. But it has some restriction because to retrieve the target image he has to know the basic physical feature space such as color, texture, shape and spatial relationship. In this paper, we propose an emotion-based retrieval system. It uses the emotion that color images have. It is different from past emotion-based image retrieval in point of view that it uses relevance feedback to estimate the users intend and it is easily combined with past content-based image retrieval system. To test the performance of our proposed system, we use MPEG-7 color descriptor and emotion language such as "warm", "clean", "bright" and "delight" We test about 1500 wallpaper images and get successful result.lpaper images and get successful result.
Error Resilient Scheme in Video Data Transmission using Information Hiding
Bae, Chang-Seok ; Choe, Yoon-Sik ;
The KIPS Transactions:PartB, volume 10B, issue 2, 2003, Pages 189~196
DOI : 10.3745/KIPSTB.2003.10B.2.189
This paper describes an error resilient video data transmission method using information hiding. In order to localize transmission errors in receiver, video encoder embeds one bit for a macro block during encoding process. Embedded information is detected during decoding process in the receiver, and the transmission errors can be localized by comparing the original embedding data. The localized transmission errors can be easily corrected, thus the degradation in a reconstructed image can be alleviated. Futhermore, the embedded information can be applied to protect intellectual property rights of the video data. Experimental results for 3 QCIF sized video sequenced composed of 150 frames respectively show that, while degradation in video streams in which the information is embedded is negligible, especially in a noisy channel, the average PSNR of reconstructed images can be improved about 5 dB by using embedded information. Also, intellectual property rights information can be effectively obtained from reconstructed images.
A Design and Implementation of the Intelligent Diagnosis System for Diseases associated with Acute Abdominal Pain Based on Fuzzy Relational Products
Hyun, Woo-Seok ;
The KIPS Transactions:PartB, volume 10B, issue 2, 2003, Pages 197~204
DOI : 10.3745/KIPSTB.2003.10B.2.197
Because most conventional systems of medical diagnosis focus on small subsets of classes of diseases of particular human organs, it is difficult to diagnosis when dealing with symptoms are related to many diseases. The author proposes an intelligent diagnosis system for diseases associated with acute abdominal pain based on fuzzy relational products (IDS-DAAP) to implement conventional system (DS-DAAP). Compared with DS-DAAP, new approach with IDS-DAAP shows that the system proposed here improves diagnosis rate and reduces diagnosis time.
A Collaboration Model Using Annotations over Shared Documents
Lee, Eun-Jung ;
The KIPS Transactions:PartB, volume 10B, issue 2, 2003, Pages 205~212
DOI : 10.3745/KIPSTB.2003.10B.2.205
ThruBook is a collaborative work platform based on ebook documents, supporting synchronized group work sessions. Using this platform, users can participate in a group session based on a same ebook, and share group actions synchronously. In this paper, we consider annotations as all kinds of additional data on ebooks, including bookmarks, memos, and figures. Also, model shared actions such as sharing/navigating these annotations. As a result, we could model all collaborative actions as annotations, and propose a collaborative work model as communicating annotation objects between session participants. This approach implemented in ThruBook platform could easily model shared actions in collaborative systems. The collaborative work platform implemented in ThruBook can be applied to other types of collaborative work systems.
Crossword Game Using Speech Technology
Yu, Il-Soo ; Kim, Dong-Ju ; Hong, Kwang-Seok ;
The KIPS Transactions:PartB, volume 10B, issue 2, 2003, Pages 213~218
DOI : 10.3745/KIPSTB.2003.10B.2.213
In this paper, we implement a crossword game, which operate by speech. The CAA (Cross Array Algorithm) produces the crossword array randomly and automatically using an domain-dictionary. For producing the crossword array, we construct seven domain-dictionaries. The crossword game is operated by a mouse and a keyboard and is also operated by speech. For the user interface by speech, we use a speech recognizer and a speech synthesizer and this provide more comfortable interface to the user. The efficiency evaluation of CAA is performed by estimating the processing times of producing the crossword array and the generation ratio of the crossword array. As the results of the CAA`s efficiency evaluation, the processing times is about 10ms and the generation ratio of the crossword array is about 50%. Also, the recognition rates were 95.5%, 97.6% and 96.2% for the window sizes of "
," and "
" respectively.}11$" respectively.vely.
Rule Construction for Determination of Thematic Roles by Using Large Corpora and Computational Dictionaries
Kang, Sin-Jae ; Park, Jung-Hye ;
The KIPS Transactions:PartB, volume 10B, issue 2, 2003, Pages 219~228
DOI : 10.3745/KIPSTB.2003.10B.2.219
This paper presents an efficient construction method of determination rules of thematic roles from syntactic relations in Korean language processing. This process is one of the main core of semantic analysis and an important issue to be solved in natural language processing. It is problematic to describe rules for determining thematic roles by only using general linguistic knowledge and experience, since the final result may be different according to the subjective views of researchers, and it is impossible to construct rules to cover all cases. However, our method is objective and efficient by considering large corpora, which contain practical osages of Korean language, and case frames in the Sejong Electronic Lexicon of Korean, which is being developed by dozens of Korean linguistic researchers. To determine thematic roles more correctly, our system uses syntactic relations, semantic classes, morpheme information, position of double subject. Especially by using semantic classes, we can increase the applicability of the rules.
HMM-based Korean Named Entity Recognition
Hwang, Yi-Gyu ; Yun, Bo-Hyun ;
The KIPS Transactions:PartB, volume 10B, issue 2, 2003, Pages 229~236
DOI : 10.3745/KIPSTB.2003.10B.2.229
Named entity recognition is the process indispensable to question answering and information extraction systems. This paper presents an HMM based named entity (m) recognition method using the construction principles of compound words. In Korean, many named entities can be decomposed into more than one word. Moreover, there are contextual relationships among nouns in an NE, and among an NE and its surrounding words. In this paper, we classify words into a word as an NE in itself, a word in an NE, and/or a word adjacent to an n, and train an HMM based on NE-related word types and parts of speech. Proposed named entity recognition (NER) system uses trigram model of HMM for considering variable length of NEs. However, the trigram model of HMM has a serious data sparseness problem. In order to solve the problem, we use multi-level back-offs. Experimental results show that our NER system can achieve an F-measure of 87.6% in the economic articles.