DOI QR코드

DOI QR Code

Image Segmentation by Cascaded Superpixel Merging with Privileged Information

단계적 슈퍼픽셀 병합을 통한 이미지 분할 방법에서 특권정보의 활용 방안

  • Park, Yongjin (Department of Computer Science and Engineering, POSTECH)
  • Received : 2019.07.19
  • Accepted : 2019.08.06
  • Published : 2019.09.30

Abstract

We propose a learning-based image segmentation algorithm. Starting from super-pixels, our method learns the probability of merging two regions based on the ground truth made by humans. The learned information is used in determining whether the two regions should be merged or not in a segmentation stage. Unlike exiting learning-based algorithms, we use both local and object information. The local information represents features computed from super-pixels and the object information represent high level information available only in the learning process. The object information is considered as privileged information, and we can use a framework that utilize the privileged information such as SVM+. In experiments on the Berkeley Segmentation Dataset and Benchmark (BSDS 500) and PASCAL Visual Object Classes Challenge (VOC 2012) data set, out model exhibited the best performance with a relatively small training data set and also showed competitive results with a sufficiently large training data set.

기존의 영역 병합을 통한 이미지 분할 방법에서는 이웃한 두 영역 사이의 정보만을 이용하여 병합 모델을 학습한다. 학습 과정에서는 두 영역 사이의 지역적인 정보뿐만 아니라 물체 정보와 같은 전역적인 정보 또한 활용 가능하므로 주어진 모든 정보를 활용하여 병합 모델의 성능을 높이는 것이 바람직하다. 본 논문에서는 학습 기반의 이미지 분할 알고리즘에서 학습 시에만 사용 가능한 특권정보를 활용하는 SVM+ 방법을 제안한다. 특권정보는 학습 시에만 사용 가능한 정보이므로 전통적인 지도학습 방법으로는 학습이 불가하다. SVM+와 같은 특권정보를 학습할 수 있는 구조를 통해 지역 정보뿐만 아니라 물체 정보를 포함하여 영역 간의 병합 여부를 결정하는 모델을 학습하였다. BSDS 500 데이터 세트와 VOC 2012 데이터 세트에서 벤치마크를 수행하였으며 대부분의 평가 지표에서 개선된 성능을 보여 주었다. 특히 학습 데이터 세트가 작은 경우에 기존의 알고리즘에 비해서 월등히 뛰어난 성능을 보인다.

Keywords

References

  1. D. Weiss, and B. Taskar, "Scalpel: Segmentation cascades with localized priors and efficient learning," in Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, pp. 2035-2042, IEEE, 2013.
  2. Z. Ren, and G. Shakhnarovich, "Image segmentation by cascaded region agglomeration," in Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, pp. 2011-2018, IEEE, 2013.
  3. R. Achanta, A. Shaji, K. Smith, A. Lucchi, P. Fua, and S. Susstrunk, "Slic superpixels compared to state-of-the-art superpixel methods," Pattern Analysis and Machine Intelligence, IEEE Transaction on, 34(11):2274-2282, 2012. https://doi.org/10.1109/TPAMI.2012.120
  4. V. Vapnik, and A. Vashist, "A new learning paradigm: Learning using privileged information," Neural Networks, 22(5):544-557, 2009. https://doi.org/10.1016/j.neunet.2009.06.042
  5. D. Martin, C. Fowlkes, D. Tal, and J. Malik, "The Berkeley Segmentation Dataset and Benchmark," Available: https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/bsds/.
  6. M. Everingham, L. V. Gool, C. Williams, J. Winn, and A. Zisserman, "The PASCAL VOC Project," Available: http://host.robots.ox.ac.uk/pascal/VOC/.
  7. N. Senthilkumaran, and S. Vaithegi, "Image segmentation by using thresholding techniques for medical images," Computer Science & Engineering: An International Journal 6.1 (2016): 1-13.
  8. N.Dhanachandra, K. Manglem, and Y. J. Chanu, "Image segmentation using K-means clustering algorithm and subtractive clustering algorithm," Procedia Computer Science 54 (2015): 764-771. https://doi.org/10.1016/j.procs.2015.06.090
  9. K. Ramgopal, and P. Gautam, "Fast medical image segmentation using energy-based method," Pattern and Data Analysis in Healthcare Settings. IGI Global, 2017. 35-60.
  10. A. Pratondo, C. K. Chui, and S. H. Ong, "Robust edge-stop functions for edge-based active contour models in medical image segmentation," IEEE Signal Processing Letters 23.2 (2015): 222-226. https://doi.org/10.1109/LSP.2015.2508039
  11. Z. Liu, X. Li, P. Luo, C. C. Loy, and X. Tang, "Semantic image segmentation via deep parsing network," In Proceedings of the IEEE international conference on computer vision, pp. 1377-1385, 2015.
  12. W. Liu, A. Rabinovich, and A. C. Berg, "Parsenet: Looking wider to see better," arXiv preprint arXiv:1506.04579, 2015.
  13. H. Noh, S. Hong, and B. Han, "Learning deconvolution network for semantic segmentation," In Proceedings of the IEEE international conference on computer vision, pages 1520-1528, 2015.
  14. K. He, G. Gkioxari, P. Dollar, and R. Girshick, "Mask rcnn," In Proceedings of the IEEE international conference on computer vision, pp. 2961-2969, 2017.
  15. H. Zhang, K. Dana, J. Shi, Z. Zhang, X. Wang, A. Tyagi, and A. Agrawal, "Context encoding for semantic segmentation," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7151-7160, 2018.
  16. C. H. Wu, C. C. Lai, H. J. Lo, and P. S. Wang, "A Comparative Study on Encoding Methods of Local Binary Patterns for Image Segmentation," International Conference on Smart Vehicular Technology, Transportation, Communication and Applications. Springer, Cham, 2018.
  17. P. Arbelaez, M. Maire, C Fowlkes, and J. Malik, "Contour detection and hierarchical image segmentation," Pattern Analysis and Machine Intelligence, IEEE Transactions on, 33(5):898-916, 2011. https://doi.org/10.1109/TPAMI.2010.161
  18. X. Ren, and J. Malik, "Learning a classication model for segmentation," In Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on, pp. 10-17. IEEE, 2003.
  19. L. Gao, J. Song, F. Nie, F. Zou, N. Sebe, and H. T. Shen, "Graph-without-cut: An ideal graph learning for image segmentation," Thirtieth AAAI Conference on Artificial Intelligence. 2016.
  20. P. F. Felzenszwalb, and D. P. Huttenlocher, "Ecient graph-based image segmentation," International Journal of Computer Vision, 59(2):167-181, 2004. https://doi.org/10.1023/B:VISI.0000022288.19776.77
  21. D. Ming, J. Li, J. Wang, and M. Zhang, "Scale parameter selection by spatial statistics for GeOBIA: Using mean-shift based multi-scale segmentation as an example," ISPRS Journal of Photogrammetry and Remote Sensing 106 (2015): 28-41. https://doi.org/10.1016/j.isprsjprs.2015.04.010
  22. M. V. Bergh, X. Boix, G. Roig, B. de Capitani, and L. Van Gool, "Seeds: Superpixels extracted via energy-driven sampling," Computer Vision-ECCV 2012, pp. 13-26, Spr. 2012.'