Fig. 2. Process of YOLO network model
Fig. 3. YOLO Network Structure
Fig. 4. Concept for multi-layer overlapped windows
Fig. 5. Sample CCTV image captured on the road
Fig. 6. Detection Result in Small Dense Vehicles case
Fig. 7. Example of assign and keep ID each vehicle on entrance and exit image
Fig. 8. Sub-Window and Intersection Area
Fig. 9. Setup method for identical vehicle decision
Fig. 10. Vehicle Identity Decision inside Intersection Area
Fig. 11. Setup Multi-Windows in Experiment
Fig. 12. Detection result for small dense vehicle
Fig. 13. Voting detection result according to distance
Fig. 14. Comparison for vehicle tracking result using own ID from entrance to way out
Fig. 15. Relationship accurate and FPS for changing filters
Fig. 1. (a) Comparison mAP and time of YOLO v2 with other algorithms [12] (b) YOLO v3 [17]]
Table. 1. Vehicle Detection Result
Table. 2. Vehicle Tracking Result
Table. 3. Comparison accurate and time according to change filters
References
- S. Russell and P. Norvig, "Artificial Intelligence : A Modern Approach," NJ, USA: Prentice Hall Press, 2009.
- O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, and L. Fei-Fei, "ImageNet large scale visual recognition challenge," Int. J. Comput. Vis., vol. 115, no. 3, pp. 211-252, Apr. 2015. https://doi.org/10.1007/s11263-015-0816-y
- S. Ioffe and C. Szegedy, "Batch normalization : Accelerating deep network training by reducing internal covariate shift," In Proc. ICML, 2015.
- T. Young, D. Hazarika, S. Poria, and E. Cambria, "Recent Trends in Deep Learning Based Natural Language Processing," arXiv preprint arXiv : 1708.02709. 2017.
- K. He, G. Gkioxari, P. Dollar, and R. Girshick, "Mask R-CNN," ICCV, pp. 2980-2988. IEEE, 2017.
- O. Ronneberger, P. Fischer, and T. Brox, "Unet: Convolutional Networks for Biomedical Image Segmentation," International Conference on Medical image computing and computer-assisted intervention, pp. 234-241, 2015.
- V. Badrinarayanan, A. Kendall, and R. Cipolla, "SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation," arXiv preprint arXiv: 1511.00561, 2015.
- Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, "Gradientbased learning applied to document recognition," Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998. https://doi.org/10.1109/5.726791
- K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition," In Proc. ICLR, 2015.
- A. Krizhevsky, I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks," Advances in neural information processing systems, 2012, pp. 1097-1105.
- C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, "Going deeper with convolutions," In Proc. CVPR, 2015.
- R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 580-587, 2014.
- R. Girshick, "Fast r-cnn," Proceedings of the IEEE International Conference on Computer Vision, 2015, pp. 1440-1448.
- W. Liu et al., "SSD: Single shot multibox detector," In Proc. ECCV, pp. 21-37, 2016.
- J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once : Unified, real-time object detection," In Proc. CVPR, 2016.
- J. Redmon and A. Farhadi, "YOLO9000 : Better, Faster, Stronger", Proceedings of the IEEE conference on computer vision and pattern recognition, pp.779-788, 2016.
- J. Redmon and A. Farhadi, "YOLOv3: An Incremental Improvement", arXiv preprint arXiv : 1804.02767, 2018.