Fast mode decision by skipping variable block-based motion estimation and spatial predictive coding in H.264

H.264의 가변 블록 크기 움직임 추정 및 공간 예측 부호화 생략에 의한 고속 모드 결정법

  • Published : 2003.09.01

Abstract

H.264, which is the latest video coding standard of both ITU-T(International Telecommunication Union-Telecommunication standardization sector) and MPEG(Moving Picture Experts Group), adopts new video coding tools such as variable block size motion estimation, multiple reference frames, quarter-pel motion estimation/compensation(ME/MC), 4${\times}$4 Integer DCT(Discrete Cosine Transform), and Rate-Distortion Optimization, etc. These new video coding tools provide good coding of efficiency compared with existing video coding standards as H.263, MPEG-4, etc. However, these new coding tools require the increase of encoder complexity. Therefore, in order to apply H.264 to many real applications, fast algorithms are required for H.264 coding tools. In this paper, when encoder MacroBlock(MB) mode is decided by rate-distortion optimization tool, fast mode decision algorithm by skipping variable block size ME/MC and spatial-predictive coding, which occupies most encoder complexity, is proposed. In terms of computational complexity, the proposed method runs about 4 times as far as JM(Joint Model) 42 encoder of H.264, while the PSNR(peak signal-to-noise ratio)s of the decoded images are maintained.

ITU-T(International Telecommunication Union-Telecommunication standardization sector)와 MPEG(Moving Picture Experts Group)에 의해서 최근 표준화가 완성된 H.264는 가변 블록 크기 움직임 추정, 복수참조영상, 1/4화소 움직임 예측/보상, 4×4 정수 DCT(Integer Discrete Cosine Transform), 율-왜곡 최적화(Rate-Distortion Optimization) 등의 새로운 부호화 기술로 H.263, MPEG-4 등 기존 비디오 표준에 비해 더 좋은 부호화 효율을 제공하고 있다. 그러나 새로운 부호화 기술들은 H.264 의 전반적인 복잡도를 심화시키는 주된 요인이기도 하다. 따라서, H.254 의 실제 응용을 용이하게 하기 위해서는 이러한 기술에 대한 고속 알고리즘이 요구된다. 본 논문에서는 율-왜곡 최적화를 통한 부호화 모드 결정시 부호화기의 복잡도에서 가장 큰 비중을 차지하는 가변 블록 크기 움직임 추정 및 공간예측 부호화를 효율적으로 생략하여 부호화 모드 결정을 빠르게 수행하는 고속 모드 결정법을 제안한다. 실험결과, 제안된 방법은 부호화 효율의 손실이 거의 없으면서도 계산법을 약 4배 향상시킨다.

Keywords

References

  1. ITU Telecom. Standardization Sector, 'Video Codec Test Model Near-Term, Version 10 (TMN10) Draft 1,' H263 Ad Hoc Group, April 1998
  2. 'Information Technolog yCoding of Audio Visual Objects Part2: Visual Amendment 1: Visual Extensions', ISO/IEC JTC1/SC29/WG11N3056, Dec. 1999
  3. Thomas Wiegand, Joint Final Committee Draft(JFCD) of Joint Video specification (ITU-T Rec. H264ISO/IEC 14496-10 AVC),JVT-G050, March, 2003
  4. Antonio Ortega and Kannan Ramchandran, 'Rate-Distortion Methods for Image and Video Compression', IEEE SIGNAL PROCESSING MAGAZINE, pp23-50, November 1998 https://doi.org/10.1109/79.733495
  5. Gary J.Sullivan and Thomas Wiegand, 'Rate-Distortion Optimization for Video Compression', IEEE SIGNAL PROCESSING MAGAZINE, pp 74-90, November 1998 https://doi.org/10.1109/79.733497
  6. P. Topiwala, G. Sullivan, A. Joch, F. Kossentini, 'Overview and Performance Evaluation of the Draft ITU-T H26L Video Coding Standard', Proc. SPIE, Appl. Dig. Im. Proc, Aug.2001
  7. ITU-T SG16 Q6, 'Performance Evaluation of H26L, TML 8 vs H263++ and MPEG-4', VCEG-N18, Sep.2001
  8. M.Zhou, 'Evaluation and Simplification of H.26L Baseline Coding Tools', JVT-B030, Jan. 2002
  9. K. Takagi, 'Lagrange Multiplier and RD-characteristics', JVT-C084, May, 2002
  10. G. Bjontegaard and K. Lillevold 'Contextadaptive VLC(CVLC) coding of coefficients', JVT-C028, May, 2002
  11. James Au, 'Complexity Reduction for CAVLC', JVT-D034, July, 2002
  12. G. Sulivan and G. Bjontegaard, 'Recommended simulation common conditions for H26L coding efficiency experiments on low-resolution progressive-scan source material', ITU-T Q.6/16, Doc.#VCEG-N81, Sep. 2001
  13. http://standard.pictel.com/ftp/videosite/h261/jm42.zip