U-net and Residual-based Cycle-GAN for Improving Object Transfiguration Performance

Kim, Sewoon;Park, Kwang-Hyun;

doi:10.7746/jkros.2018.13.1.001

The Journal of Korea Robotics Society (로봇학회논문지)

Volume 13 Issue 1
/
Pages.1-7
/
2018
/
1975-6291(pISSN)
/
2287-3961(eISSN)

Korea Robotics Society (한국로봇학회)

DOI QR Code

U-net and Residual-based Cycle-GAN for Improving Object Transfiguration Performance

물체 변형 성능을 향상하기 위한 U-net 및 Residual 기반의 Cycle-GAN

Kim, Sewoon (School of Robotics, Kwangwoon University) ;
Park, Kwang-Hyun (School of Robotics, Kwangwoon University)

김세운 ;
박광현

Received : 2018.01.16
Accepted : 2018.02.20
Published : 2018.02.28

https://doi.org/10.7746/jkros.2018.13.1.001 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

The image-to-image translation is one of the deep learning applications using image data. In this paper, we aim at improving the performance of object transfiguration which transforms a specific object in an image into another specific object. For object transfiguration, it is required to transform only the target object and maintain background images. In the existing results, however, it is observed that other parts in the image are also transformed. In this paper, we have focused on the structure of artificial neural networks that are frequently used in the existing methods and have improved the performance by adding constraints to the exiting structure. We also propose the advanced structure that combines the existing structures to maintain their advantages and complement their drawbacks. The effectiveness of the proposed methods are shown in experimental results.

Keywords

References

P. Isola, J.-Y. Zhu, T. Zhou, and A. A. Efros, "Image-to-image translation with conditional adversarial networks," 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, pp. 5967-5976, 2017.
R. Zhang, P. Isola, and A. A. Efros, "Colorful image colorization," European Conference on Computer Vision,, Amsterdam, Netherlands, pp. 649-666, 2016.
Preferred Networks, PaintsChainer, [Online], https://paintschainer.preferred.tech, Accessed: January 16, 2018.
I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde- Farley, S. Ozair, A. Courville, and Y. Bengio, "Generative adversarial nets," in 28th Annual Conference on Neural Information Processing Systems, Montreal, Canada, pp. 2672-2680, 2014.
J.-Y. Zhu, T. Park, P. Isola, and A. A. Efros, "Unpaired image-to-image translation using cycle-consistent adversarial networks," 2017 IEEE International Conference on Computer Vision, Venice, Italy, pp. 2242-2251, 2017.
O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, and L. Fei-Fei, "ImageNet Large scale visual recognition challenge," International Journal of Computer Vision, vol. 115, no. 3, pp. 211-252, April, 2015. https://doi.org/10.1007/s11263-015-0816-y
T. Kim, M. Cha, H. Kim, J. K. Lee, and J. Kim, "Learning to discover cross-domain relations with generative adversarial networks," International Conference on Machine Learning, Sydney, Australia, pp. 1857-1865, 2017.
Z. Yi, H. Zhang, P. Tan, and M. Gong, "Dualgan: Unsupervised dual learning for image-to-image translation," in IEEE International Conference on Computer Vision, Venice, Italy, pp. 2868-2876, 2017.
K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," 2017 IEEE International Conference on Computer Vision, Las Vegas, USA, pp. 770-778, 2016.
K. He, X. Zhang, S. Ren, and J. Sun, "Identity mappings in deep residual networks," European Conference on Computer Vision, Amsterdam, Netherlands, pp. 630-645, 2016.
J. Johnson, A. Alahi, and L. Fei-Fei, "Perceptual losses for real-time style transfer and super-resolution," European Conference on Computer Vision, Amsterdam, Netherlands, pp. 694-711, 2016.
O. Ronneberger, P. Fischer, and T. Brox, "U-net: Convolutional networks for biomedical image segmentation," International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany, pp.234-241, 2015.
X. Mao, Q. Li, H. Xie, R. Y.K. Lau, Z. Wang, and S. P. Smolley, "Least squares generative adversarial networks," 2017 IEEE International Conference on Computer Vision, Venice, Italy, pp. 2813-2821, 2017.
A. Shrivastava, T. Pfister, O. Tuzel, J. Susskind, W. Wang, and R. Webb, "Learning from simulated and unsupervised images through adversarial training," 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, pp. 2242-2251, 2017.
E. Borenstein and S. Ullman, "Combined top-down/bottom-up segmentation," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 30, no. 12, pp. 2109-2125, December, 2008. https://doi.org/10.1109/TPAMI.2007.70840
W. Mokrzycki and M. Tatol, "Color difference ΔE - A survey," Machine Graphic and Vision, vol. 20, no. 4, pp. 383-411, April, 2011.

The Journal of Korea Robotics Society (로봇학회논문지)

U-net and Residual-based Cycle-GAN for Improving Object Transfiguration Performance

물체 변형 성능을 향상하기 위한 U-net 및 Residual 기반의 Cycle-GAN

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)