DOI QR코드

DOI QR Code

U-net and Residual-based Cycle-GAN for Improving Object Transfiguration Performance

물체 변형 성능을 향상하기 위한 U-net 및 Residual 기반의 Cycle-GAN

  • Received : 2018.01.16
  • Accepted : 2018.02.20
  • Published : 2018.02.28

Abstract

The image-to-image translation is one of the deep learning applications using image data. In this paper, we aim at improving the performance of object transfiguration which transforms a specific object in an image into another specific object. For object transfiguration, it is required to transform only the target object and maintain background images. In the existing results, however, it is observed that other parts in the image are also transformed. In this paper, we have focused on the structure of artificial neural networks that are frequently used in the existing methods and have improved the performance by adding constraints to the exiting structure. We also propose the advanced structure that combines the existing structures to maintain their advantages and complement their drawbacks. The effectiveness of the proposed methods are shown in experimental results.

Keywords

References

  1. P. Isola, J.-Y. Zhu, T. Zhou, and A. A. Efros, "Image-to-image translation with conditional adversarial networks," 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, pp. 5967-5976, 2017.
  2. R. Zhang, P. Isola, and A. A. Efros, "Colorful image colorization," European Conference on Computer Vision,, Amsterdam, Netherlands, pp. 649-666, 2016.
  3. Preferred Networks, PaintsChainer, [Online], https://paintschainer.preferred.tech, Accessed: January 16, 2018.
  4. I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde- Farley, S. Ozair, A. Courville, and Y. Bengio, "Generative adversarial nets," in 28th Annual Conference on Neural Information Processing Systems, Montreal, Canada, pp. 2672-2680, 2014.
  5. J.-Y. Zhu, T. Park, P. Isola, and A. A. Efros, "Unpaired image-to-image translation using cycle-consistent adversarial networks," 2017 IEEE International Conference on Computer Vision, Venice, Italy, pp. 2242-2251, 2017.
  6. O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, and L. Fei-Fei, "ImageNet Large scale visual recognition challenge," International Journal of Computer Vision, vol. 115, no. 3, pp. 211-252, April, 2015. https://doi.org/10.1007/s11263-015-0816-y
  7. T. Kim, M. Cha, H. Kim, J. K. Lee, and J. Kim, "Learning to discover cross-domain relations with generative adversarial networks," International Conference on Machine Learning, Sydney, Australia, pp. 1857-1865, 2017.
  8. Z. Yi, H. Zhang, P. Tan, and M. Gong, "Dualgan: Unsupervised dual learning for image-to-image translation," in IEEE International Conference on Computer Vision, Venice, Italy, pp. 2868-2876, 2017.
  9. K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," 2017 IEEE International Conference on Computer Vision, Las Vegas, USA, pp. 770-778, 2016.
  10. K. He, X. Zhang, S. Ren, and J. Sun, "Identity mappings in deep residual networks," European Conference on Computer Vision, Amsterdam, Netherlands, pp. 630-645, 2016.
  11. J. Johnson, A. Alahi, and L. Fei-Fei, "Perceptual losses for real-time style transfer and super-resolution," European Conference on Computer Vision, Amsterdam, Netherlands, pp. 694-711, 2016.
  12. O. Ronneberger, P. Fischer, and T. Brox, "U-net: Convolutional networks for biomedical image segmentation," International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany, pp.234-241, 2015.
  13. X. Mao, Q. Li, H. Xie, R. Y.K. Lau, Z. Wang, and S. P. Smolley, "Least squares generative adversarial networks," 2017 IEEE International Conference on Computer Vision, Venice, Italy, pp. 2813-2821, 2017.
  14. A. Shrivastava, T. Pfister, O. Tuzel, J. Susskind, W. Wang, and R. Webb, "Learning from simulated and unsupervised images through adversarial training," 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, pp. 2242-2251, 2017.
  15. E. Borenstein and S. Ullman, "Combined top-down/bottom-up segmentation," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 30, no. 12, pp. 2109-2125, December, 2008. https://doi.org/10.1109/TPAMI.2007.70840
  16. W. Mokrzycki and M. Tatol, "Color difference ΔE - A survey," Machine Graphic and Vision, vol. 20, no. 4, pp. 383-411, April, 2011.