DOI QR코드

DOI QR Code

깊이 얼굴 영상 부호화에서의 양자화 인자 결정 방법

Quantization Parameter Determination Method for Face Depth Image Encoding

  • 박동진 (동의대학교 컴퓨터소프트웨어공학과) ;
  • 권순각 (동의대학교 컴퓨터소프트웨어공학과)
  • 투고 : 2020.01.10
  • 심사 : 2020.02.02
  • 발행 : 2020.02.29

초록

본 논문에서는 얼굴 인식 정확도에 미치는 영향을 최소화하면서 효율적으로 깊이 얼굴 영상을 압축하기 위한 양자화 변수 결정 방법을 제안한다. H.264/AVC의 양자화를 적용하여 깊이 얼굴 영상을 압축 할 때 얼굴 특징을 최대한 유지할 수 있도록 타원체 모델링의 예측 정확도와 각각의 양자화 단위 블록의 얼굴 인식에서의 중요도를 이용하여 양자화 인자를 차등적으로 부여한다. 모의실험 결과 제안된 방법을 통해 같은 압축율에서 얼굴 인식 성공률이 최대 6% 개선되었다.

In this paper, we propose a quantization parameter determination method for face depth image encoding in order to minimize an impact on a face recognition accuracy. When a face depth image is compressed through quantization in H.264/AVC, differential quantization parameters are assigned according to an accuracy of ellipsoid modeling prediction and an importance degree of a unit block in extracting facial features. The simulation results show that the face recognition success rates are improved by up to 6% at the same compression rates through the proposed compression rate determination method.

키워드

참고문헌

  1. Kwon, S. K. (2019). Face Recognition Using Depth and Infrared Pictures, Nonlinear Theory and Its Applications, IEICE, 10(1), 2-15. https://doi.org/10.1587/nolta.10.2.
  2. Wu, H., Sun, X., Yang, J., and Wu, F. (2019). 3D Mesh Based Inter-Image Prediction for Image Set Compression, Proceeding of the IEEE International Conference on Multimedia and Expo, July 8-12, Shanghai, China.
  3. Schnabel, R., and Klein, R. (2006). Octree-based Point-cloud Compression. Proceeding of the Eurographics Symposium on Point-Based Graphics, July 29-30, Boston, Massachusetts, USA.
  4. Gumhold, S., Karni, Z., Isenburg, M., and Seidel, H. (2005) Predictive Pointcloud Compression, Proceeding of ACM SIGGRAPH, Jul. 31-Aug. 4, Los Angeles, California, USA.
  5. Morvan, Y., Farin, D., and deWith, P.H.N. (2007). Depth-image Compression Based on An R-D Optimized Quadtree Decomposition for The Transmission of Multiview Images, Proceeding of the IEEE International Conference on Image Processing, Sep. 16-Oct. 19, San Antonio, Texas, USA.
  6. Milani, S., and Calvagno, G. (2010). A Depth Image Coder Based on Progressive Silhouettes, IEEE Signal Process. Letters, 17(8), 711-714. https://doi.org/10.1109/LSP.2010.2051619.
  7. Shen, G., Kim, W., Narang, S., Orterga, A., Lee, J., and Wey, H. (2010). Edge Adaptive Transform for Efficient Depth Map Coding, Proceeding of Picture Coding Symposium, Dec. 8-10, Nagoya, Japan.
  8. Maitre, M., and Do, M. (2010). Depth and Depth-Color Coding Using Shape Adaptive Wavelets, Journal of Visual Communication and Image Representation, 21(5-6), 513-522. https://doi.org/10.1016/j.jvcir.2010.03.005.
  9. Fu, J., Miao, D., Yu, W., Wang, S., Lu, Y., and Li, S. (2013). Kinect-Like Depth Data Compression, IEEE Transactions on Multimedia, 15(6), 1340-1352. https://doi.org/10.1109/TMM.2013.2247584
  10. Wang, X., Sekercioglu, Y. A., Drummond, T., Natalizio, E., Fantoni, I., and Fremont, V. (2016). Fast Depth Video Compression for Mobile RGB-D Sensors, IEEE Transactions on Circuits and Systems for Video Technology, 26(4), 673-686, https://doi.org/10.1109/TCSVT.2015.2416571
  11. Mamou, K., Zaharia, T., and Preteux, F. (2008). FAMC: The MPEG-4 Standard for Animated Mesh Compression, Proceeding of the IEEE International Conference on Image Processing, Oct. 12-15, San Diego, California, USA.
  12. Paysan, P., Knothe, R., Amberg, B., Romdhani, S., and Vetter, T. (2009). A 3D Face Model for Pose and Illumination Invariant Face Recognition, Proceeding of IEEE International Conference on Advanced Video and Signal Based Surveillance, Sep. 2-4, Genova, Italy.
  13. Lee, D. S. and Kwon, S. K. (2018). Intra Prediction of Depth Picture with Plane Modeling, Symmetry, 10, 1-16. https://doi.org/10.3390/sym10120715
  14. Takagi, K., Takishima, Y., and Nakajima, Y. (2003). A Study on Rate Distortion Optimization Scheme for JVT Coder, Proceeding of SPIE, Apr. 21, Orlando, Florida, USA.
  15. Ma, S., Wen, G., and Yan, L. (2005). Rate-Distortion Analysis for H.264/AVC Video Coding and its Application to Rate Control, IEEE Transactions on Circuits and Systems for Video Technology, 15(12), 1533-1544. https://doi.org/10.1109/TCSVT.2005.857300
  16. Hg, R. I., Jasek, P., Rofidal, C., Nasrollahi, K., and Moeslund, T. B. (2012). An RGB-D Database using Microsoft's Kinect for Windows for Face Detection, Proceeding of the IEEE 8th International Conference on Signal Image Technology & Internet Based Systems, Nov. 25-29, Sorrento, Naples, Italy.
  17. Lee, D. S., Han D. H., and Kwon, S. K. (2018). Face Recognition Method by Using Infrared and Depth Images, Journal of the Korea Industrial Information Systems Research, 23(2), 1-9. http://dx.doi.org/10.9723/jksiis.2018.23.2.001.