Example-based Super Resolution Text Image Reconstruction Using Image Observation Model

Park, Gyu-Ro;Kim, In-Jung;

doi:10.3745/KIPSTB.2010.17B.4.295

The KIPS Transactions:PartB (정보처리학회논문지B)

Volume 17B Issue 4
/
Pages.295-302
/
2010
/
1598-284X(pISSN)

Korea Information Processing Society (한국정보처리학회)

DOI QR Code

Example-based Super Resolution Text Image Reconstruction Using Image Observation Model

영상 관찰 모델을 이용한 예제기반 초해상도 텍스트 영상 복원

박규로 (한동대학교 정보통신공학과) ;
김인중 (한동대학교)

Received : 2010.06.04
Accepted : 2010.07.21
Published : 2010.08.31

https://doi.org/10.3745/KIPSTB.2010.17B.4.295 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

Example-based super resolution(EBSR) is a method to reconstruct high-resolution images by learning patch-wise correspondence between high-resolution and low-resolution images. It can reconstruct a high-resolution from just a single low-resolution image. However, when it is applied to a text image whose font type and size are different from those of training images, it often produces lots of noise. The primary reason is that, in the patch matching step of the reconstruction process, input patches can be inappropriately matched to the high-resolution patches in the patch dictionary. In this paper, we propose a new patch matching method to overcome this problem. Using an image observation model, it preserves the correlation between the input and the output images. Therefore, it effectively suppresses spurious noise caused by inappropriately matched patches. This does not only improve the quality of the output image but also allows the system to use a huge dictionary containing a variety of font types and sizes, which significantly improves the adaptability to variation in font type and size. In experiments, the proposed method outperformed conventional methods in reconstruction of multi-font and multi-size images. Moreover, it improved recognition performance from 88.58% to 93.54%, which confirms the practical effect of the proposed method on recognition performance.

예제기반 초해상도 영상 복원(EBSR)은 고해상도 영상과 저해상도 영상간의 패치간 대응관계를 학습함으로써 고해상도 영상을 복원하는 방법으로, 한 장의 저해상도 영상으로부터도 고해상도 영상을 복원할 수 있는 장점이 있다. 그러나, 폰트의 종류나 크기가 학습 영상과 다른 텍스트 영상을 적용할 경우 잡영을 많이 발생시킨다. 그 이유는 복원 과정 중 매칭 단계에서 입력 패치들이 사전 내의 고해상도 패치와 부적절하게 매칭될 수 있기 때문이다. 본 논문에서는 이러한 문제점을 극복하기 위한 새로운 패치 매칭 방법을 제안한다. 제안하는 방법은 영상 관찰 모델을 이용하여 입력 영상과 출력 영상간의 상관 관계를 보존함으로써 잘못 매칭된 패치로 인한 잡영을 효과적으로 억제한다. 이는 출력 영상의 화질을 개선할 뿐 아니라, 다양한 종류 및 크기의 폰트를 포함한 대용량 패치 사전을 적용할 수 있게 함으로써 폰트의 종류 및 크기의 변이에 대한 적응력을 크게 향상시킨다. 실험에서 제안하는 방법은 폰트와 크기가 다양한 영상에 대하여 기존의 방법보다 우수한 영상 복원 성능을 나타내었다. 뿐만 아니라, 인식 성능도 88.58%에서 93.54%로 개선되어 제안하는 방법이 인식 성능의 개선에도 효과적임을 확인하였다.

Keywords

References

류상진, 김인중, "저화질 영상 인식을 위한 화질 저하 모델 기반 다중 인식기 결합", 정보처리학회논문지B, 제17-B권, 제3호, pp.233-238, 2010. https://doi.org/10.3745/KIPSTB.2010.17B.3.233
S. C. Park, M. K. Park, and M. G. Kang," Super-resolution image reconstruction: a technical overview," IEEE Signal Processing Magazine, Vol.20, No.3, pp.21-36, 2003. https://doi.org/10.1109/MSP.2003.1203207
M. E. Tipping and C. M. Bishop, "Bayesian image super-resolution," Advances in Neural Information Processing Systems 15, MIT Press, Cambridge, MA, pp.1279-1286, 2003.
W. T. Freeman, T. R. Jones, and E. C Pasztor, "Example-Based Super-Resolution," IEEE Computer Graphics and Applications, Vol.22, No.2, pp.56-65, 2002. https://doi.org/10.1109/38.988747
J.Park, Y.Kwon, J.Kim, "An Example-Based Prior Model for Text Image Super-resolution," Proc. of ICDAR2005, Seoul, Korea, pp.374-378, 2005.
L. C. Pickup, D. P. Capel, S. J. Roberts, and A. Zisserman, "Bayesian image super-resolution, continued," Advances in Neural Information Processing Systems 19, pp.1089-1096, Cambridge, Mass, USA, December, 2006.
D. Chekhlov, "Super-Resolution of Images," Ph.D. Thesis, Bristol, pp.1-12, 2005.
L.G. Brown, "A survey of image registration techniques," ACM. Computing Surveys, Vol.24, pp.326-376, 1992.
D. P. Capel, "Image Mosaicing and Super-resolution," Ph.D. thesis, University of Oxford, 2001.
K. Donaldson and G. K. Myers, "Bayesian Super-Resolution of Text in Video with a Text-Specific Bimodal Prior," IJDAR, Vol.7, No.2, pp.1433-2833, 2005.
http://people.csail.mit.edu/hasinoff/320/sliding-notes.pdf
박규로, 김인중, "단계적 후보 축소에 의한 예제기반 초해상도 영상복원을 위한 고속 패치 검색", 정보과학회논문지, Vol.37, No.4, pp.264-272, 2010.
http://ai.kaist.ac.kr/Resource/dbase/Image%20Database.htm
http://en.wikipedia.org/wiki/Interpolation
http://en.wikipedia.org/wiki/RMSE
R.O. Duda, P.E. Hart and D.G. Stork, Pattern Classification 2nd ed., pp.36-45, Wiley-Interscience, 2001.
C. L. Liu, "Normalization-Cooperated Gradient Feature Extraction for Handwritten Character Recognition," IEEE TPAMI, Vol.29, No.8, pp.1465-1469. 2007. https://doi.org/10.1109/TPAMI.2007.1090
C. L. Liu, I. J. Kim, and J. H. Kim, "High Accuracy Handwritten Chinese Character Recognition by Improved Feature Matching Method," Proc. 4th ICDAR, Ulm, Germany, pp.1033-1037, 1997.

Cited by

Super Resolution Algorithm Based on Edge Map Interpolation and Improved Fast Back Projection Method in Mobile Devices vol.1, pp.2, 2012, https://doi.org/10.3745/KTSDE.2012.1.2.103

The KIPS Transactions:PartB (정보처리학회논문지B)

Example-based Super Resolution Text Image Reconstruction Using Image Observation Model

영상 관찰 모델을 이용한 예제기반 초해상도 텍스트 영상 복원

Abstract

Keywords

References

Cited by

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)