Journal of the Korean Institute of Telematics and Electronics (대한전자공학회논문지)
- Volume 25 Issue 9
- /
- Pages.1091-1101
- /
- 1988
- /
- 1016-135X(pISSN)
A Study on the Korean Character Segmentation and Picture Extraction from a Document
한국어 문서로부터 문자분리 및 도형추출에 관한 연구
Abstract
In this paper, a method to segment each character and extract figure from Korean documents is proposed. At first, each character string is extracted by means of iterative horizontal propagation, shrink algorithm and run-length algorithm. Individual character region is extracted by iterative horizontal and vertical manipulation. Next, characters of right pitch are searched. Each character is segmented by the position information. Overlapped character is segmented on the ground of the width of already extracted character. The rest are extracted as special characters of half pitch. Using 9 data input in the form of 840 X 600 from Korean monthly magazine, experiment was simulated. Extraction rate of character is 100%, and that of individual character is 98%. Judging from these results, efficiency on extracting character region and segmenting individual character is proved.
Keywords