Reconstruction of Overlapping Character in Thai Printed Documents

  • Nucharee Pemchaiswa (Faculty of Information Technology, King Mogkut's Institute of Technology Ladkrabang, Bangkok) ;
  • Wichian Premchaiswadi (Faculty of Information Technology, King Mogkut's Institute of Technology Ladkrabang, Bangkok) ;
  • Voravit Premratanachai (Faculty of Information Technology, King Mogkut's Institute of Technology Ladkrabang, Bangkok) ;
  • Seinosuke Narita (Department of Electrical, Electronics and Computer Engineering, School of Science and Engineering, Waseda University)
  • 발행 : 2000.07.01

초록

This paper proposes a reconstruction scheme for overlapping characters in Thai printed document. Overlapping characters are characters that overlap with surrounding characters. The problem of overlapping characters is still an unsolved problem In commercially available software of Thai character recognition systems. The algorithm of reconstruction scheme is based on structural analysis of overlapping Thai printed characters. It consists of 2 steps: overlapping point determination and reconstruction of segmented characters. The overlapping point is defined as the intersection point between characters and can be determined by using templates. Then, an overlapping character is separated into segments at the intersection point. The structure of each segment may be an incomplete character and is not identical to the original one. Therefore, the reconstruction process is employed to add the incomplete part of these segments. The proposed scheme has been implemented and tested with 70 patterns of conventionally found in overlapping printed Thai characters with different typefaces and type sizes. The experimental results show that the proposed scheme can segment and reconstruct overlapping characters correctly. The proposed scheme can improve the recognition rate of commercially available software, ThaiOCR1.5 and ArnThai1.0, more than 60 percents

키워드