Automatic Correction of Word-spacing Errors using by Syllable Bigram

음절 bigram를 이용한 띄어쓰기 오류의 자동 교정

  • 강승식 (국민대학교 자연과학대학 컴퓨터학부)
  • Published : 2001.06.01

Abstract

We proposed a probabilistic approach of using syllable bigrams to the word-spacing problem. Syllable bigrams are extracted and the frequencies are calculated for the large corpus of 12 million words. Based on the syllable bigrams, we performed three experiments: (1) automatic word-spacing, (2) detection and correction of word-spacing errors for spelling checker, and (3) automatic insertion of a space at the end of line in the character recognition system. Experimental results show that the accuracy ratios are 97.7 percent, 82.1 percent, and 90.5%, respectively.

Keywords