Noise Removal using Support Vector Regression in Noisy Document Images



Kim, Hee-Hoon;Kang, Seung-Hyo;Park, Jai-Hyun;Ha, Hyun-Ho;Lim, Dong-Hoon

  • 투고 : 2012.06.18
  • 심사 : 2012.07.23
  • 발행 : 2012.08.31


Noise removal of document images is a necessary step during preprocessing to recognize characters effectively because it has influences greatly on processing speed and performance for character recognition. We have considered using the spatial filters such as traditional mean filters and Gaussian filters, and wavelet transformed based methods for noise deduction in natural images. However, these methods are not effective for the noise removal of document images. In this paper, we present noise removal of document images using support vector regression. The proposed approach consists of two steps which are SVR training step and SVR test step. We construct an optimal prediction model using grid search with cross-validation in SVR training step, and then apply it to noisy images to remove noises in test step. We evaluate our SVR based method both quantitatively and qualitatively for noise removal in Korean, English and Chinese character documents, and compare it to some existing methods. Experimental results indicate that the proposed method is more effective and can get satisfactory removal results.


Cross-validation;grid search;support vector regression;noise removal


  1. Chang, C. C. and Lin, C. J. (2011). LIBSVM: A library for support vector machines, URL:
  2. Cherkassky, V. and Mulier, F. (1998). Learning from Data: Concepts Theory and Methods, Wiley, New York.
  3. Don, H. S. (2001). A noise attribute thresholding method for document image binarization, International Journal on Document Analysis and Recognition, 4, 131-138.
  4. Donoho, D. L. (1995). De-noising by soft thresholding, IEEE Transactions on Information Theory, 41, 613-627.
  5. Donoho, D. L. and Johnstone, I. M. (1994). Spatial adaptation by wavelet shrinkage, Biometrika, 81, 425-455.
  6. Gonzales, R. C. and Woods, R. E. (1992). Digital Image Processing, Addison-Wesley Publishing Company.
  7. Li, A., Mersereau, R. and Simske, S. (2007). Blind image deconvolution through support vector regression, IEEE Transactions on Neural Networks, 931-935.
  8. Li, D. (2009). Support vector regression based image denoising, Image and Vision Computing, 27, 623-627.
  9. Lin, T. C. and Yu, P. T. (2004). Adaptive two-pass median filter based on support vector machines for image restoration, Neural Computation, 16, 192-206.
  10. Liu, H., Sun, F. and Sun, Z. (2006). Image filtering using support vector machine, Lecture Notes in Computer Science, 3972, 533-538.
  11. Vapnik, V. (1995). The Nature of Statistical Learning Theory, Springer-Verlag.
  12. Vetterli, M., Chang, S. G. and Yu, B. (2000). Adaptive wavelet thresholding for image denoising and compression, IEEE Transactions on Image Processing, 9, 1532-1546.
  13. Wang, X. Y. and Fu, Z. K. (2010). A wavelet-based image denoising using least squares support vector machine, Engineering Applications of Artificial Intelligence, 862-871.
  14. Wang, X. Y., Yang, H. Y. and Fu, Z. K. (2010). A new wavelet-based image denoising using undecimated discrete wavelet transform and least squares support vector machine, Expect Systems with Applications, 37, 7040-7049.


연구 과제 주관 기관 : National Research Foundation of Korea(NRF)