DOI QR코드

DOI QR Code

Object detection technology trend and development direction using deep learning

  • Kwak, NaeJoung (Dept. of Cyber and Security, Baejae Univ.) ;
  • Kim, DongJu (Postech Institute of Artificial Intelligence, POSTECH)
  • Received : 2020.11.17
  • Accepted : 2020.12.13
  • Published : 2020.12.31

Abstract

Object detection is an important field of computer vision and is applied to applications such as security, autonomous driving, and face recognition. Recently, as the application of artificial intelligence technology including deep learning has been applied in various fields, it has become a more powerful tool that can learn meaningful high-level, deeper features, solving difficult problems that have not been solved. Therefore, deep learning techniques are also being studied in the field of object detection, and algorithms with excellent performance are being introduced. In this paper, a deep learning-based object detection algorithm used to detect multiple objects in an image is investigated, and future development directions are presented.

Keywords

References

  1. A.Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classification with deep convolutional neural networks," in Proc. NIPS, 2012.
  2. S. Ren, K. He, R. Girshick, and J. Sun, "Faster r-cnn: Towards real-time object detection with region proposal networks," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 39, pp. 1137-1149, June 2017. https://doi.org/10.1109/TPAMI.2016.2577031
  3. J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once: Unified, real-time object detection," in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779-788, June 2016.
  4. W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, and A. C. Berg, "Ssd: Single shot multibox detector," in Computer Vision , pp. 21-37, 2016.
  5. Uijlings, J. RR et al.,"Selective search for object recognition," Int. journal of computer vision, pp.154-171., 2013
  6. R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," in IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, June 2014.
  7. R. Girshick, "Fast r-cnn," in IEEE International Conference on Computer Vision (ICCV), pp. 1440-1448, Dec 2015.
  8. K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition," arXiv preprint arXiv:1409.1556, 2014.
  9. K. He, G. Gkioxari, P. Dollr, and R. Girshick, "Mask r-cnn," in IEEE International Conference on Computer Vision (ICCV), pp. 2980-2988, Oct 2017.
  10. T. Lin, P. Dollr, R. Girshick, K. He, B. Hariharan, and S. Belongie, "Feature pyramid networks for object detection," in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 936-944, July 2017.
  11. J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once: Unified, real-time object detection," in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779-788, June 2016.
  12. J. Redmon and A. Farhadi, "Yolo9000: Better, faster, stronger," in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6517-6525, July 2017.
  13. S. Ioffe and C. Szegedy, "Batch normalization: Accelerating deep network training by reducing internal covariate shift," arXiv preprint arXiv:1502.03167, 2015.
  14. J. Redmon et al, "YOLOv3: An Incremental Improvement," arXiv 1804.02767
  15. K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770-778, June 2016.
  16. A. Bochkovskiy, W. Chien-Yao, M. L. Hong-Yuan, "YOLOv4: Optimal Speed and Accuracy of Object Detection," arXiv:2004.10934v1, 2020.
  17. W. Chien-Yao Wang, M. L. Hong-Yuan, W. Yueh-Hua, C. Ping-Yang, H. Jun-Wei, and Y. I-Hau, "CSPNet:A new backbone that can enhance learning capability of cnn," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshop, 2020.
  18. S. Liu, L. Qi, Q. Haifang, S. Jianping, and J. Jiaya, "Path aggregation network for instance segmentation," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 8759-8768, 2018.
  19. S.Zhang, L.Wen, X.Bian, Z.Lei, and S.Z.Li, "Single-shot refinement neural network for object detection," in Proceedings of the IEEE Conference on Computer Vision, 2018
  20. H. Hu, J. Gu, Z. Zhang, J. Dai, and Y. Wei, "Relation networks for object detection," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3588-3597, 2018.
  21. X. Zhou, D. Wang, P. Krahenb, "Objects as Points," arXiv:1904.07850, 2019.
  22. M. Tan, R. Pang, Q. V. Le, "EfficientDet: Scalable and Efficient Object Detection," arXiv preprint arXiv:1911.09070v4, 2020.
  23. M. Tan, Q.V. Le, "EfficientNet : Rethinking Model Scaling for Convolutional Neural Networks, arXiv:1905.11946v5, 2019.
  24. H. Bilen, A. Vedaldi, "Universal representations : The missing link between faces, text, planktons, and cat breeds," arXiv:1701.07275 , 2017.
  25. X. Wang, Z. Cai, D. Gao, and N. Vasconcelos, "Towards universal object detection by domain attention," arXiv:1904.04402, 2019.