DOI QR코드

DOI QR Code

A New Stereo Matching Algorithm based on Variable Windows using Frequency Information in DWT Domain

DWT 영역에서의 주파수 정보를 활용한 가변 윈도우 기반의 스테레오 정합 알고리즘

  • Received : 2012.02.28
  • Accepted : 2012.03.23
  • Published : 2012.07.31

Abstract

In this paper we propose a new stereo matching algorithm which is suitable for application to obtain depth information with high-speed in stereoscopic camera environment. For satisfying these condition we propose a new adaptive stereo matching technique using frequency information in discrete wavelet (DWT) domain and variable matching window. The size of the matching window is selected by analysis of the local property of the image in spatial domain and the feature and scaling factor of the matching window is selected by the frequency property in the frequency domain. For using frequency information we use local DWT and global DWT. We identified that the proposed technique has better peak noise to signal ratio (PSNR) than the fixed matching techniques with similar complexity.

본 논문에서는 스테레오 카메라 환경에서 고속으로 깊이 정보를 얻기 위한 응용분야에 적합한 스테레오 정합 기법을 제안하고자 한다. 이러한 조건을 만족하기 위해서 DWT 영역에서의 주파수 정보와 가변 정합창을 이용하는 적응적인 스테레오 정합 기법을 제안한다. 공간 영역에서 영상의 국부적인 특성을 분석하여 정합창의 크기를 결정하고, 주파수 영역에서 영상의 주파수 특성을 분석하여 정합창의 형태 및 스케일링 요소를 결정한다. 주파수 영역에 대한 정보를 이용하기 위해서 로컬 DWT와 전역 DWT를 활용하는 기법을 모두 적용하였다. 본 논문은 스테레오 정합을 위한 제안한 기법은 유사한 수준의 복잡도에서 고정 정합창 기반의 기법과 비교할 때 PSNR이 향상되는 것을 확인하였다.

Keywords

References

  1. ISO/IEC MPEG & ITU-T VCEG, "Multiview video plus depth (MVD) format for advanced 3D video systems," JVT-W100, April 2007.
  2. ISO/IEC, "ISO/IEC JTC1/SC29/WG11 Coding of Moving Picture and Audio," Draft of version 4 of ISO/IEC 14496-10 (E) MPEG05/N7081, April 2005.
  3. D. Scharstein and R. Szeliski, "A taxonomy and evaluation of dense two-frame stereo correspondence algorithms," International Journal of Computer Vision, Vol. 47, Issue 1-3, pp. 7-42, April 2002. https://doi.org/10.1023/A:1014573219977
  4. J. Sun, N. N. Zheng, and H. Y. Shum, "Stereo matching using belief propagation," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 27, Issue 25, pp. 787-800, July 2003.
  5. P. N. Belhumeur, "A Bayesian approach to binocular stereopsis," International Journal of Computer Vision, Vol. 19, Issue 3, pp. 237-260, Aug. 1996. https://doi.org/10.1007/BF00055146
  6. I. Gallo, E. Binaghi, and M. Raspanti, "Neural disparity computation for dense two-frame stereo correspondence," Pattern Recognition Letters, Vol. 29, Issue 5, pp. 673-687, April 2008. https://doi.org/10.1016/j.patrec.2007.12.003
  7. Y. Ohta and T. Kanade, "Stereo by intra- and inter-scanline search using dynamic programming," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. PAMI-7, March 1985.
  8. C. J. Tsa and A. K. Katsaggelos, "Dense disparity estimation with a divide-and-conquer disparity space image technique," IEEE Transactions on Multimedia, Vol. 1, Issue 1, pp. 18-29, March 1999. https://doi.org/10.1109/6046.748168
  9. C. Georgoulas, L. Kotoulas, G. Ch. Sirakoulis, I. Andreadis, and A. Gasteratos, "Real-time disparity map computation module," Microprocessors and Microsystems, Vol. 32, Issue 3, pp. 159-170, May 2008. https://doi.org/10.1016/j.micpro.2007.10.002
  10. D. I. Han, B. M. Lee, J. I. Cho, and D. H. Hwang, "Real-time object segmentation using disparity map of stereo matching," Applied Mathematics and Computation, Vol. 205, Issue 2, pp. 770-777, Nov. 2008. https://doi.org/10.1016/j.amc.2008.05.110
  11. http://vision.middlebury.edu/stereo/
  12. M. Z. Brown, D. Burschka, and G. D. Hager, "Advances in computational stereo," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 25, Issue 8, pp. 993-1008 Aug. 2003. https://doi.org/10.1109/TPAMI.2003.1217603
  13. ISO/IEC 11172-2, "Information technology coding of moving picture and associated audio for digital storage media at up to about 1.5Mbps," International Standard, 1993.
  14. ISO/IEC 11172-2, "Information technology coding of moving picture and associated : video," International Standard, 1995.
  15. ISO/IEC 14496-2, "Information technology coding of audio-visual object," International Standard, 2001.
  16. R. C. Gonzales and R. E. Woods, "Digital image processing," Prentice Hall, 2nd edtion, 2001.
  17. I. Daubechies and W. Sweldens, "Factoring wavelet transforms into lifting schemes," J. Fourier Anal. Appl., Vol. 4, pp. 247-269, 1998. https://doi.org/10.1007/BF02476026