DOI QR코드

DOI QR Code

Contact Detection based on Relative Distance Prediction using Deep Learning-based Object Detection

딥러닝 기반의 객체 검출을 이용한 상대적 거리 예측 및 접촉 감지

  • Hong, Seok-Mi (Department of Liberal Arts, Sangji University) ;
  • Sun, Kyunghee (Contents Convergence Software Research Institute, Kyonggi University) ;
  • Yoo, Hyun (Contents Convergence Software Research Institute, Kyonggi University)
  • 홍석미 (상지대학교 교양대학) ;
  • 선경희 (경기대학교 콘텐츠융합소프트웨어연구소) ;
  • 유현 (경기대학교 콘텐츠융합소프트웨어연구소)
  • Received : 2021.11.01
  • Accepted : 2022.01.20
  • Published : 2022.01.28

Abstract

The purpose of this study is to extract the type, location, and absolute size of an object in an image using a deep learning algorithm, predict the relative distance between objects, and use this to detect contact between objects. To analyze the size ratio of objects, YOLO, a CNN-based object detection algorithm, is used. Through the YOLO algorithm, the absolute size and position of an object are extracted in the form of coordinates. The extraction result extracts the ratio between the size in the image and the actual size from the standard object-size list having the same object name and size stored in advance, and predicts the relative distance between the camera and the object in the image. Based on the predicted value, it detects whether the objects are in contact.

본 연구의 목적은 딥러닝 알고리즘을 이용하여 영상 내 객체의 종류, 위치, 절대 크기를 추출하고, 객체간 상대적 거리를 예측하며, 이를 이용하여 객체 간의 접촉을 감지하기 위한 내용이다. 객체의 크기 비율을 분석하기 위하여, CNN 기반의 Object Detection 알고리즘인 YOLO를 이용한다. YOLO 알고리즘을 통하여 2D 형태의 이미지에서 각 개체의 절대적인 크기와 위치를 좌표의 형태로 추출한다. 추출 결과는 사전에 저장된 동일한 객체의 명칭과 크기를 가지는 표준 객체-크기 리스트로부터 영상 내 크기와 실제 크기 간의 비례를 추출하며, 영상 내 카메라-객체 간의 상대적인 거리를 예측한다. 예측된 값을 바탕으로 영상에서 객체 간 접촉 여부를 감지한다.

Keywords

Acknowledgement

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (Grant No.: NRF-2020R1A6A1A03040583).

References

  1. R. Mu & X. Zeng. (2019). A review of deep learning research. KSII Trans. Internet Inf. Syst, 13(4), 1738-1764. DOI : 10.3837/tiis.2019.04.001
  2. K. He, X. Zhang, S. Ren & J. Sun. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770-778).
  3. J. Long, E. Shelhamer & T. Darrell. (2015). Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3431-3440).
  4. R. Huang, J. Pedoeem & C. Chen. (2018). YOLO-LITE: a real-time object detection algorithm optimized for non-GPU computers. In 2018 IEEE International Conference on Big Data (Big Data) (pp. 2503-2510). IEEE.
  5. M. Tan & Q. Le. (2019). Efficientnet: Rethinking model scaling for convolutional neural networks. In International Conference on Machine Learning (pp. 6105-6114). PMLR.
  6. H. Nguyen, L. M. Kieu, T. Wen & C. Cai. (2018). Deep learning methods in transportation domain: a review. IET Intelligent Transport Systems, 12(9), 998-1004. https://doi.org/10.1049/iet-its.2018.0064
  7. P. W. Khan, Y. C. Byun & N. Park. (2020). A data verification system for CCTV surveillance cameras using blockchain technology in smart cities. Electronics, 9(3), 484. DOI : 10.3390/electronics9030484
  8. M. J. Shafiee, B. Chywl, F. Li & A. Wong. (2017). Fast YOLO: A fast you only look once system for real-time embedded object detection in video. arXiv Prepr. arXiv1709.05943.
  9. L. Liu et al. (2020). Deep Learning for Generic Object Detection: A Survey. International journal of computer vision, 128(2), 261-318. https://doi.org/10.1007/s11263-019-01247-4
  10. Y. C. Ahn & Y. H. Lee. (2019). A Research of CNN based Object Detection for Multiple Object Tracking in Image. Journal of the Semiconductor & Display Technology, 18(3), 110-114.
  11. P. Wang et al. (2018). Understanding convolution for semantic segmentation. In 2018 IEEE winter conference on applications of computer vision (WACV), 1451-1460.
  12. M. Yang, K. Yu, C. Zhang, Z. Li & K. Yang. (2018). Denseaspp for semantic segmentation in street scenes. in Proceedings of the IEEE conference on computer vision and pattern recognition, 3684-3692.
  13. A. Garcia-Garcia, S. Orts-Escolano, S. Oprea, V. Villena-Martinez & J. Garcia-Rodriguez. (2017). A review on deep learning techniques applied to semantic segmentation. arXiv Prepr. arXiv1704.06857.
  14. T. Y. Lin et al. (2014). Microsoft coco: Common objects in context. In European conference on computer vision, 740-755.
  15. National Information Society Agency. (2021). AI Hub. National Information Society Agency(Online). https://aihub.or.kr