Object detection technology trend and development direction using deep learning

Kwak, NaeJoung;Kim, DongJu;

doi:10.17703/IJACT.2020.8.4.119

International Journal of Advanced Culture Technology

Volume 8 Issue 4
/
Pages.119-128
/
2020
/
2288-7202(pISSN)
/
2288-7318(eISSN)

The International Promotion Agency of Culture Technology (국제문화기술진흥원)

DOI QR Code

Object detection technology trend and development direction using deep learning

Kwak, NaeJoung (Dept. of Cyber and Security, Baejae Univ.) ;
Kim, DongJu (Postech Institute of Artificial Intelligence, POSTECH)

Received : 2020.11.17
Accepted : 2020.12.13
Published : 2020.12.31

https://doi.org/10.17703/IJACT.2020.8.4.119 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

Object detection is an important field of computer vision and is applied to applications such as security, autonomous driving, and face recognition. Recently, as the application of artificial intelligence technology including deep learning has been applied in various fields, it has become a more powerful tool that can learn meaningful high-level, deeper features, solving difficult problems that have not been solved. Therefore, deep learning techniques are also being studied in the field of object detection, and algorithms with excellent performance are being introduced. In this paper, a deep learning-based object detection algorithm used to detect multiple objects in an image is investigated, and future development directions are presented.

Keywords

References

A.Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classification with deep convolutional neural networks," in Proc. NIPS, 2012.
S. Ren, K. He, R. Girshick, and J. Sun, "Faster r-cnn: Towards real-time object detection with region proposal networks," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 39, pp. 1137-1149, June 2017. https://doi.org/10.1109/TPAMI.2016.2577031
J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once: Unified, real-time object detection," in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779-788, June 2016.
W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, and A. C. Berg, "Ssd: Single shot multibox detector," in Computer Vision , pp. 21-37, 2016.
Uijlings, J. RR et al.,"Selective search for object recognition," Int. journal of computer vision, pp.154-171., 2013
R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," in IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, June 2014.
R. Girshick, "Fast r-cnn," in IEEE International Conference on Computer Vision (ICCV), pp. 1440-1448, Dec 2015.
K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition," arXiv preprint arXiv:1409.1556, 2014.
K. He, G. Gkioxari, P. Dollr, and R. Girshick, "Mask r-cnn," in IEEE International Conference on Computer Vision (ICCV), pp. 2980-2988, Oct 2017.
T. Lin, P. Dollr, R. Girshick, K. He, B. Hariharan, and S. Belongie, "Feature pyramid networks for object detection," in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 936-944, July 2017.
J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once: Unified, real-time object detection," in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779-788, June 2016.
J. Redmon and A. Farhadi, "Yolo9000: Better, faster, stronger," in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6517-6525, July 2017.
S. Ioffe and C. Szegedy, "Batch normalization: Accelerating deep network training by reducing internal covariate shift," arXiv preprint arXiv:1502.03167, 2015.
J. Redmon et al, "YOLOv3: An Incremental Improvement," arXiv 1804.02767
K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770-778, June 2016.
A. Bochkovskiy, W. Chien-Yao, M. L. Hong-Yuan, "YOLOv4: Optimal Speed and Accuracy of Object Detection," arXiv:2004.10934v1, 2020.
W. Chien-Yao Wang, M. L. Hong-Yuan, W. Yueh-Hua, C. Ping-Yang, H. Jun-Wei, and Y. I-Hau, "CSPNet:A new backbone that can enhance learning capability of cnn," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshop, 2020.
S. Liu, L. Qi, Q. Haifang, S. Jianping, and J. Jiaya, "Path aggregation network for instance segmentation," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 8759-8768, 2018.
S.Zhang, L.Wen, X.Bian, Z.Lei, and S.Z.Li, "Single-shot refinement neural network for object detection," in Proceedings of the IEEE Conference on Computer Vision, 2018
H. Hu, J. Gu, Z. Zhang, J. Dai, and Y. Wei, "Relation networks for object detection," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3588-3597, 2018.
X. Zhou, D. Wang, P. Krahenb, "Objects as Points," arXiv:1904.07850, 2019.
M. Tan, R. Pang, Q. V. Le, "EfficientDet: Scalable and Efficient Object Detection," arXiv preprint arXiv:1911.09070v4, 2020.
M. Tan, Q.V. Le, "EfficientNet : Rethinking Model Scaling for Convolutional Neural Networks, arXiv:1905.11946v5, 2019.
H. Bilen, A. Vedaldi, "Universal representations : The missing link between faces, text, planktons, and cat breeds," arXiv:1701.07275 , 2017.
X. Wang, Z. Cai, D. Gao, and N. Vasconcelos, "Towards universal object detection by domain attention," arXiv:1904.04402, 2019.

International Journal of Advanced Culture Technology

Object detection technology trend and development direction using deep learning

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)