Advanced SearchSearch Tips
Automatic Segmentation of Product Bottle Label Based on GrabCut Algorithm
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
 Title & Authors
Automatic Segmentation of Product Bottle Label Based on GrabCut Algorithm
Na, In Seop; Chen, Yan Juan; Kim, Soo Hyung;
  PDF(new window)
In this paper, we propose a method to build an accurate initial trimap for the GrabCut algorithm without the need for human interaction. First, we identify a rough candidate for the label region of a bottle by applying a saliency map to find a salient area from the image. Then, the Hough Transformation method is used to detect the left and right borders of the label region, and the k-means algorithm is used to localize the upper and lower borders of the label of the bottle. These four borders are used to build an initial trimap for the GrabCut method. Finally, GrabCut segments accurate regions for the label. The experimental results for 130 wine bottle images demonstrated that the saliency map extracted a rough label region with an accuracy of 97.69% while also removing the complex background. The Hough transform and projection method accurately drew the outline of the label from the saliency area, and then the outline was used to build an initial trimap for GrabCut. Finally, the GrabCut algorithm successfully segmented the bottle label with an average accuracy of 92.31%. Therefore, we believe that our method is suitable for product label recognition systems that automatically segment product labels. Although our method achieved encouraging results, it has some limitations in that unreliable results are produced under conditions with varying illumination and reflections. Therefore, we are in the process of developing preprocessing algorithms to improve the proposed method to take into account variations in illumination and reflections.
GrabCut;Segmentation;Bottle label;Initial trimap;Human interaction;
 Cited by
S. W. Hong and L. Choi, "Automatic Flowers Recognition Using Segmentation," Korea Computer Congress, vol. 38, no. 1(A), 2011, pp. 463-465.

J. S. Lee, S. H. Kim, and J. H Park, G. S Lee, H. J Yang, C. W. Lee, "Recognition of Text in Wine Label Images," IEEE Pattern Recognition on Chinese Conference, 2009, pp. 1-5.

N. Otsu, "A Threshold Selection Method from Gray-Level Histograms," IEEE Transactions on Systems, Man, and Cybernetics, vol. 9, no. 1, 1979, pp. 62-66. crossref(new window)

J. B. MacQueen, Some Methods for Classification and Analysis of Multiplicate Observations, Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, University of California Press, 1967, pp. 281-297.

D. Comaniciu and P. Meer, "Mean Shift: A Robust Approach Toward Feature Space Analysis," IEEE Transactions on PAMI, vol. 24, no. 5, 2002, pp. 1-17. crossref(new window)

Jong Hyun Park, Guee Sang Lee, and Soon Yong Park, "Color image segmentation using adaptive mean shift and statistical model-based method," Computers and mathematics with Applications, vol. 57, issue 6, Mar. 2009, pp. 970-980. crossref(new window)

T. F. Chan and L. A. Vese, "Active Contours Without Edge," IEEE Transactions on Image Processing, vol. 10, no. 2, 2001, pp. 266-277. crossref(new window)

A. Tasi, A. Yezzi, and A. Willsky, "Curve Evolution Implementation of the Mumford-Shah Functional for Image Segmentation, Depositing, Interpolation and Margination," Institute of Electrical and Electronics Engineers Transactions on Image Processing, vol. 10, no. 8, 2001, pp. 1169-1186.

D. Cremers, "A multiphase level set framework for variational motion segmentation," In Scale Space Methods in Computer Vision, vol. 2695, 2003, pp. 599-614.

C. Li, C. Xu, and M. D. Fox, "Level set evolution without reinitialization: A new variational formulation," Institute of Electrical and Electronics Engineers Conference on Computer Vision and Pattern Recognition, vol. 1, 2005, pp. 430-436.

C. Li, C. Y. Kao, J. C. Gore, and Z. Ding, "Implicit Active Contours Driven by Local Binary Fitting Energy," IEEE Conference on Computer Vision and Pattern Recognition, 2007, pp. 1-7.

V. Caselles, F. Catte, T. Coll, and F. Dibis, "A geometric model for active contours in image processing," Nummerische Mathematik, no. 66, 1993, pp. 1-31.

M. Rousson and R. Seriche, "A Variational Framework for Active and Adapt active Segmentation of Vector Valued Images," Proceeding of IEEE Workshop on Motion and Video Computing, 2002.

J. B. Shi and J. D. Malik, "Normalized Cuts and Image Segmentation," IEEE Transactions on pattern analysis and machine intelligence, vol. 22, no. 8, 2000, pp. 888-905. crossref(new window)

Y. Boykov and Vladimir Kolmogorov, "An Experimental Comparison of Min-Cut/Flow Algorithms for Energy Minimization in Vision," IEEE Transactions on pattern analysis and machine intelligence, vol. 26, no. 9, 2004, pp. 1124-1137. crossref(new window)

Y. Boycov and M. Jolly, "Interactive graph cuts for optimal boundary and region segmentation of objects in N-D images," In Proc. IEEE Int. Conf. on Computer Vision, In ICCV, 2001, pp. 105-112.

C. Rother, V. Kolmogorov, and A. Blake, "GrabCut: interactive for foreground using iterate graph cuts," SCM Transaction on Graphics(TOG), 2004.

Justin F. Talbot and Xiaoqian Xu, "Implementing GrabCut," Brigham Yong University Revised, Apr. 7, 2006.

Peng Wang, "GrabCut-Interactive Foreground Extraction,"

B. C. Ko and J. Y. Nam, "Object-of-interest image segmentation based on human attention and semantic region," Optical Society of Society of America, vol. 23, Oct. 2006, pp. 2462-2470. crossref(new window)

L. Itti, C. Koch, and E. Niebur, "A Model of Saliency-Based Visual Attention for Rapid Scene Analysis," Institute of Electrical and Electronics Engineers Transactions on Pattern Analysis and Machine Intelligence, vol. 20, 1998, pp. 1254-1259.

L. Itti, "Automatic Foveation for Video Compression Using a Neurobiological Model of Visual Attention," Institute of Electrical and Electronics Engineers Transactions on Image Processing, vol. 13, no. 10, 2004, pp. 1304-1318.

L. Elazary and L. Itti, "Interesting objects are visually salient," Journal of Vision, vol. 8, no. 33, 2008, pp. 1-15.

L. Itti and C. Koch, "Computational modeling of visual attention," Nature Reviews Neuroscience, vol. 2, no. 3, 2001, pp. 194-203. crossref(new window)

J. Harel, C. Koch, and P. Perona, Graph-based visual saliency, in Advances in Neural Information Processing System 19, Cambridge, MA: MIT Press, 2007, pp. 545-552.

D. Ballard, "Generalizing the Hough Transform to Detection Arbitrary Shape," Pattern Recognition, vol. 13, no. 2, 1981, pp. 111-122. crossref(new window)

R. Duda and P. Hart, "Use of the Hough Transformation to Detect Line and Curves in picturers," Communication of the ACM, vol. 15, no. 1, Jan. 1972, pp. 11-15. crossref(new window)

R. C. Gonzalez, E. W. and S. L. Eddins, Digital Image Processing using MATLAB, Publishing House of Electronics Industry, 2002.