Representative Feature Extraction of Objects using VQ and Its Application to Content-based Image Retrieval

VQ를 이용한 영상의 객체 특징 추출과 이를 이용한 내용 기반 영상 검색

  • 장동식 (고려대학교 산업시스템정보공학과) ;
  • 정세환 (LG 기술원 Innovation Center Digital Vision 그룹 연구원) ;
  • 유헌우 (고려대학교 산업시스템정보공학과) ;
  • 손용준 (고려대학교 산업시스템정보공학과)
  • Published : 2001.12.01

Abstract

In this paper, a new method of feature extraction of major objects to represent an image using Vector Quantization(VQ) is proposed. The principal features of the image, which are used in a content-based image retrieval system, are color, texture, shape and spatial positions of objects. The representative color and texture features are extracted from the given image using VQ(Vector Quantization) clustering algorithm with a general feature extraction method of color and texture. Since these are used for content-based image retrieval and searched by objects, it is possible to search and retrieve some desirable images regardless of the position, rotation and size of objects. The experimental results show that the representative feature extraction time is much reduced by using VQ, and the highest retrieval rate is given as the weighted values of color and texture are set to 0.5 and 0.5, respectively, and the proposed method provides up to 90% precision and recall rate for 'person'query images.

내용 기반 영상 검색을 위해 본 연구에서는 VQ(Vector Quantization)을 이용하여 영상을 구성하는 주요 객체들의 특징 추출 방법을 제안한다. 내용 기반 영상 검색 시스템에서 사용되는 영상의 주요특징으로는 색상, 절감, 형태 및 영상을 구성하고 있는 객체들의 공간적 위치 등이 있다. 이 중 본 논문에서는 일반적인 색상 및 질감 특징 추출방법과 더불어 VQ 멕터 클러스터링 알고리즘을 이용하여 정지영상을 구성하고 있는 객체들의 대표 색상과 질감 특징을 빠르게 추출하고 이를 내용 기반 검색에 이용함으로써 정지영상의 내용에 근거한 검색을 하였고 객체 단위 검색을 함으로써 객체의 위치, 회전 및 크기 변화에 무관한 검색을 가능케 했다. 연구의 실험 결과 VQ를 이용함으로써 대표특징치 추출시간을 줄일수 있었고 검색시 색상과 질감 특징의 가중치를 각각 0.5, 0.5로 주는 것이 가장 높은 검출율을 보였으며, ‘사람’영상에 제한한 방법을 적용한 경우 90%의 검출율을 보였다.

Keywords

References

  1. W. Niblack, R Barber, W. Equitz, M. Flickner. E. Glasman, D. Potkovic, C. Faloutsos, and G. Taubin, 'The QBIC project : Querying images by content using color, texture, and shape,' In Proc. SPIE vol.1908 : Storage and Retrieval for Image and Video Databases, pp.173-181, February 1993 https://doi.org/10.1117/12.143648
  2. A. Hampapur, A. Gupta, B. Horowitz, C. Fuller, J.R Bach, M. Gorkani, R C. Jain, Virago Inc., 'Virage Video Engine,' In Proc. SPIE Vol.3022:Storage and Retrieval for Image and Video Databases, pp.188-198, February 1997 https://doi.org/10.1117/12.263407
  3. J. R. Smith and S. F. Chang, 'VisualSEEK : A Fully Automated Content- Based Image Query System,' in Proc. ACM Int'l. Conf. Multimedia, pp.87-98, 1996 https://doi.org/10.1145/244130.244151
  4. M.J. Swain, C. Frankel, Vassilis Athitsas, ' Webseer An Image Search Engine for the World Wide Web,' Chicago Univ. Technical Report 96-114
  5. Tom M. Mitchell, Machine Learning, The McGraw-Hill Companies, Inc., 1997
  6. Serge Belongie, Chad Carson, Hayit greenspan, and Iitendra Malik, 'Color and Texture-Based Image Segmentation Using EM and Its Application to Content-Based linage Retrieval,' Sixth Intenational Conference on Computer Vison, pp.675-682, January. 1998 https://doi.org/10.1109/ICCV.1998.710790
  7. Abhijit. S.Pandy, Pattern Recognition With Neural Networks in C++, IEEE Press, 1995
  8. Ioannis Pitas, Digital Image Processing Algorithms, Prentice Hall, England Cliffs, NJ, 1993
  9. Rafael C. Gonzalez, Richard E. Woods, Digital Image Processing, Addison- Wesley Publishing Company, 1993
  10. P.Aigrain, H. Zang and D. Petkovic, 'Content-Based Representation and Retrieval of Visual Media : A State-of-the-Art Review.' Multimedia Tools and Applications. vol.3, pp.179-202, 1996 https://doi.org/10.1007/BF00393937
  11. Robert M. Haralick, K Shanmugam, Its'hak Dinstein, 'Textural Features for Image Classification' IEEE Transactions on Systems, Man, and Cybernetics Vol. SMC-3, No.6 November 1973
  12. W. Y. Ma and B. S. Manjunath, 'A Pattern Thesaurus for Browsing Large Aerial Photographs,' Tech, Rep. ECE TR-96-10, June 1996
  13. P. P. Ohanian and Richard C. Dubes, 'Performance Evaluation For Four Classes of Textural Features,' Pattern Recognition, vol.25, no.8, pp. 819-833, 1992 https://doi.org/10.1016/0031-3203(92)90036-I
  14. Th. Hermes, Ch. Klauck, J. Krey B, and J. Zhang, 'Image Retrieval for Informatin Systems,' In Proc, SPIE Vol 2420: Storage and Retrieval for Image and Video Databases, pp.394-405, February 1995 https://doi.org/10.1117/12.205310