DOI QR코드

DOI QR Code

Deep Learning-based Image Data Processing and Archival System for Object Detection of Endangered Species

  • Choe, Dea-Gyu (Department of Computer Science, Sangmyung University) ;
  • Kim, Dong-Keun (Department of Human-Centered Artificial Intelligence, Sangmyung University)
  • Received : 2020.10.31
  • Accepted : 2020.12.22
  • Published : 2020.12.31

Abstract

It is important to understand the exact habitat distribution of endangered species because of their decreasing numbers. In this study, we build a system with a deep learning module that collects the image data of endangered animals, processes the data, and saves the data automatically. The system provides a more efficient way than human effort for classifying images and addresses two problems faced in previous studies. First, specious answers were suggested in those studies because the probability distributions of answer candidates were calculated even if the actual answer did not exist within the group. Second, when there were more than two entities in an image, only a single entity was focused on. We applied an object detection algorithm (YOLO) to resolve these problems. Our system has an average precision of 86.79%, a mean recall rate of 93.23%, and a processing speed of 13 frames per second.

Keywords

References

  1. H. Nguyen, S. J. Maclagan, T. D. Nguyen, T. Nguyen, P. Flemons, K. Andrews, E. G. Ritchie, and D. Phung, "Animal Recognition and Identification with Deep Convolutional Neural Networks for Automated Wildlife Monitoring," in Proceedings of the 2017 IEEE International Conference on Data Science and Advanced Analytics (DSAA), Tokyo, Japan, pp. 40-49, 2017. DOI: 10.1109/DSAA.2017.31.
  2. P. Zhuang, L. Xing, Y. Liu, S. Guo, and Y. Qiao, "Marine Animal Detection and Recognition with Advanced Deep Learning Models," 2017 CLEF Working Notes of CLEF 2017 - Conference and Labs of the Evaluation Forum, vol. 1866, 2017.
  3. M. S. Norouzzadeh, A. Nguyen, M. Kosmala, A. Swanson, M. S. Palmer, C. Packer, and J. Clune, "Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning," Proceedings of theNational Academy of Sciences, vol. 115, no. 25, pp. E5716-E5725, 2018. DOI: 10.1073/pnas.1719367115.
  4. S. F. Pires, "The illegal parrot trade: a literature review," Global Crime, vol. 13, no. 3, pp. 176-190, 2012. DOI: 10.1080/17440572.2012.700180.
  5. S. H. Kim and B. H. Yu, "Automatic Identification of Wild Animals using Deep Learning," Proceedings of the Korean Society of Environment and Ecology Conference Korean Society of Environment and Ecology Annual, vol. 2018, no. 1, pp. 34-35, 2018.
  6. S. Ren, K. He, R. Girshick, and J. Sun, "Faster R-CNN: towards real-time object detection with region proposal networks," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, pp. 1137-1149, 2016. DOI: 10.1109/TPAMI.2016.2577031.
  7. J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You Only Look Once: Unified, Real-Time Object Detection," in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, pp. 779-788, 2016. DOI: 10.1109/CVPR.2016.91.
  8. Opencv, Opencv Tutorials [Internet], Available: https://docs.opencv.org/master/d9/df8/tutorial_root.html.
  9. Flask, Flask Documentations [Internet], Available: https://flask.palletsprojects.com/en/1.1.x/.
  10. Tensorflow, Tensorflow Serving [Internet], Available: https://tensorflowkorea.gitbooks.io/tensorflow-kr/content/g3doc/tutorials/tfserve/.
  11. ORACLE, MySQL [Internet], Available: https://www.mysql.com/.
  12. D. G. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," International Journal of Computer Vision, vol. 60, pp. 91-110, 2004. DOI: 10.1023/B:VISI.0000029664.99615.94.
  13. N. Dalal and B. Triggs, "Histograms of oriented gradients for human detection," in 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), San Diego: CA, pp. 886-893, 2005. DOI: 10.1109/CVPR.2005.177.
  14. Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel, "Backpropagation Applied to Handwritten Zip Code Recognition," in Neural Computation, vol. 1, no. 4, pp. 541-551, Dec. 1989. DOI: 10.1162/neco.1989.1.4.541.
  15. K. He, X. Zhang, S. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas: NV, pp. 770-778, 2016. DOI: 10.1109/CVPR.2016.90.
  16. W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. Y. Fu, and A. C. Berg, "SSD: Single Shot MultiBox Detector," ECCV 2016: Computer Vision, pp. 21-37, 2016. DOI: 10.1007/978-3-319-46448-0_2.
  17. K. He, X. Zhang, S. Ren, and J. Sun, "Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, no. 9, pp. 1904-1916, 2014. DOI: 10.1109/TPAMI.2015.2389824.
  18. J. Redmon and A. Farhadi, "YOLO9000: Better, Faster, Stronger," in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu: HI, pp. 6517-6525, 2017. DOI: 10.1109/CVPR.2017.690.
  19. J. Redmon and A. Farhadi, "YOLOv3: An Incremental Improvement," 2018, [Online] Available: https://arxiv.org/abs/1804.02767.
  20. F. Chen, N. Chen, H. Mao, and H. Hu, "Assessing four Neural Networks on Handwritten Digit Recognition Dataset (MNIST)," 2018, [Online] Available: https://arxiv.org/abs/1811.08278.
  21. A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet Classification with Deep Convolutional Neural Networks," Neural Information Processing Systems. vol. 60, no. 6, 2017 DOI: 10.1145/3065386.
  22. J. Deng, W. Dong, R. Socher, L. Li, K. Li, and L. Fei-Fei, "ImageNet: A large-scale hierarchical image database," in 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami: FL, pp. 248-255, 2009. DOI: 10.1109/CVPR.2009.5206848.
  23. N. Sun, X. Mo, T. Wei, D. Zhang, and W. Luo, "The Effectiveness of Noise in Data Augmentation for Fine-Grained Image Classification," in ACPR 2019: Pattern Recognition, Cham, Switzerland : Springer, pp. 779-792, 2020. DOI: 10.1007/978-3-030-41404-7_55.
  24. R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation," in 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus: OH, pp. 580-587, 2014. DOI: 10.1109/CVPR.2014.81.
  25. J. K. Min and J. S. Moon, "A Substitute Model Learning Method Using Data Augmentation with a Decay Factor and Adversarial Data Generation Using Substitute Model," Journal of The Korea Institute of Information Security & Cryptology, vol. 29, no. 6, pp. 1383-1392, 2019. DOI: 10.13089/JKIISC.2019.29.6.1383.
  26. H. J. Shin, S. I. Lee, H. W. Jeoung, and J. W. Park, "Indoor Plants Image Classification Using Deep Learning and Web Application for Providing Information of Plants," Journal of Knowledge Information Technology and Systems (JKITS), vol. 15, no. 2, pp. 167-175, 2020. DOI: 10.34163/jkits.2020.15.2.002.
  27. S. W. Jung, I. S. Kim, I. K. Kim, and K. S. Lim, "A Study on Data Imbalance Problem Using GAN (Generative Adversarial Network)," Proceedings of Symposium of the Korean Institute of Communications and Information Sciences, pp. 1390-1391, Jan. 2019
  28. A. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, and H. Adam, "MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications," 2017 [Online] Available: https://arxiv.org/abs/1704.04861.
  29. T. Lin, P. Dollar, R. Girshick, K. He, B. Hariharan, and S. Belongie, "Feature Pyramid Networks for Object Detection," in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu: HI, pp. 936-944, 2017. DOI: 10.1109/CVPR.2017.106.
  30. Z. Huang, Z. Pan, and B. Lei, "Transfer learning with deep convolutional neural network for SAR target classification with limited labeled data," Remote Sensing, vol. 9, no. 9, pp. 907, 2017. DOI: 10.3390/rs9090907.
  31. H. S. Lee, J. G. Kim, J. W. Yu, Y. S. Jeong, and S. S. Kim, "Multi-class Classification using Transfer Learning based Convolutional Neural Network," Journal of Korean Institute of Intelligent Systems, vol. 28, no. 6, pp. 531-537, 2018. DOI: 10.5391/JKIIS.2018.28.6.531.
  32. S. J. Park, J. H. Yoon, and C. B. Ahn, "Compressed-Sensing Cardiac CINE MRI using Neural Network with Transfer Learning," Journal of IKEEE, vol. 23, no. 4, pp. 293-299, 2019. DOI: 10.7471/ikeee.2019.23.4.1408.
  33. D. G. Cheo, E. J. Choi, and D. K. Kim, "The Real-Time Mobile Application for Classifying of Endangered Parrot Species Using the CNN Models Based on Transfer Learning," Mobile Information. Systems, vol. 2020, pp. 13, 2020. DOI: 10.1155/2020/1475164.