DOI QR코드

DOI QR Code

Development of sound location visualization intelligent control system for using PM hearing impaired users

청각 장애인 PM 이용자를 위한 소리 위치 시각화 지능형 제어 시스템 개발

  • 조용현 (동신대학교 에너지기계시스템공학과) ;
  • 최진영 (동신대학교 기계공학과)
  • Received : 2022.03.29
  • Accepted : 2022.06.17
  • Published : 2022.06.30

Abstract

This paper is presents an intelligent control system that visualizes the direction of arrival for hearing impaired using personal mobility, and aims to recognize and prevent dangerous situations caused by sound such as alarm sounds and crack sounds on roads. The position estimation method of sound source uses a machine learning classification model characterized by generalized correlated phase transformation based on time difference of arrival. In the experimental environment reproducing the road situations, four classification models learned after extracting learning data according to wind speeds 0km/h, 5.8km/h, 14.2km/h, and 26.4km/h were compared with grid search cross validation, and the Muti-Layer Perceptron(MLP) model with the best performance was applied as the optimal algorithm. When wind occurred, the proposed algorithm showed an average performance improvement of 7.6-11.5% compared to the previous studies.

본 논문은 퍼스널 모빌리티(Personal Mobility, PM)를 이용하는 청각 장애인에게 소리가 발생하는 도래각(Direction of Arrival, DOA)을 시각화하는 지능형 제어 시스템을 제시하며 도로에서 발생하는 경보음, 크락션 등 소리로 인한 위험한 상황들을 인지하고 예방하고자 한다. 소리 위치 추정 방법은 GCC-PHAT(Generalized Cross-Correlation Phase Transform) 기반 도착 지연 시간(Time Difference of Arrival, TDOA)을 특징으로 갖는 머신러닝 분류 모델을 사용한다. 도로 상황을 재현한 실험 환경에서 각각 풍속 0, 5.8, 14.2, 26.4km/h의 조건에 따라 학습 데이터를 추출한 후 학습한 4가지 분류 모델들을 Grid search cross validation으로 비교하며 성능이 가장 우수한 MLP(Multi-Layer Perceptron) 모델을 알고리즘으로 적용하였다. 최종적으로 바람이 발생하였을 때 제안된 알고리즘이 평균 90.7%의 정확도를 나타내었으며, 이는 기존의 일반적인 소리 위치 추정기법보다 평균 7.6-11.5% 정도의 성능 향상을 보이는 것이다.

Keywords

Acknowledgement

본 논문은 2021년도 교육부의 재원으로 한국연구재단의 지원을 받아 수행된 지자체-대학 협력기반 지역혁신 사업의 결과입니다.(1345341781)

References

  1. Y.-H. Seo, "Autonomous driving research trends in personal mobility", Mag. of Monthly Software Oriented Society, vol. 85, July 2021. 
  2. The Seoul Institute, "Utilization of personal mobility and preparation of conflict mitigation measures", Research report, Aug. 2020. 
  3. ECM Co., Ltd, "Development of an in-wheel electric wheelchair capable of automatic posture correction", Research report, Sept. 2011. 
  4. J.-H. Choi, and B.-J. Choi, "Design of self-localization based autonomous driving platform for an electric wheelchair", IEMEK Journal of Embedded Systems and Applications, vol. 13, no. 3, pp. 161-167, 2018.  https://doi.org/10.14372/IEMEK.2018.13.3.161
  5. 소셜 포커스, "넘어짐을 방지하는 전동휠체어 개발", https://www.socialfocus.co.kr/news/articleView.html?idxno=1247, 2018년 12월. 
  6. J.-E. Kim, K. Moon, and C. Jeong, "A Study on hazardous sound classification based on SE-ResNeXt", Journal of the Korean Institute of Intelligent Systems, vol. 30, no. 2, pp. 113-119, 2020.  https://doi.org/10.5391/JKIIS.2020.30.2.113
  7. S.-W. Byun, and S.-P. Lee, "A design of dangerous sound detection engine of wearable device for hearing impaired persons", Journal of the Korean Institute of Electrical Engineers, vol. 65, no. 7, pp. 1263-1269, 2016.  https://doi.org/10.5370/KIEE.2016.65.7.1263
  8. J.-S. Choi, H.-S. Lee, and K.-S. Park, "A robust sound localization algorithm using coordinate transform", Journal of the Institute of Electronics and Information Engineers, vol. 55, no. 7, pp. 67-74, 2018. 
  9. J. Won, and K. Park, "Impact point estimation system of the rifle based on time difference of arrival method using microphone array", Journal of the Acoustical Society of Korea, vol. 37, no. 4, pp. 206-214, 2018.  https://doi.org/10.7776/ASK.2018.37.4.206
  10. K. M. Jeon, H. K. Kim, and S. Y. Won, "Nonnegative matrix factorization based direction-of-arrival estimation of multiple sound sources using dual microphone array", Journal of the Institute of Electronics and Information Engineers, vol. 54, no. 2, pp. 123-129, 2017.  https://doi.org/10.5573/IEIE.2017.54.2.123
  11. O. Kwon and J.-H. Chang, "Multichannel microphone array-based direction of arrival estimation using deep neural network" Proceedings of Symposium of the Korean Institute of Communications and Information Sciences, pp. 949-950, June 2017. 
  12. J. H. Kim, J. M. Moon, J. E. Lim, E. Choi, and H. K. Kim, "Convolutional neural network based direction-of-arrival estimation of sound event using multi-channel circular microphone array" Proceedings of Symposium of the Korean Institute of Communications and Information Sciences, pp. 264-265, Nov. 2018. 
  13. J. M. Vera-Diaz, D. Pizarro, and J. Macias-Guarasa, "Towards end-to-end acoustic localization using deep learning: from audio signal to source position coordinates", Sensors, vol. 18, no. 10, 3418, 2018. 
  14. C. Knapp, and G. Carter, "The generalized correlation method for estimation of time delay", IEEE Transactions on Acoustics, Speech and Signal Processing. vol. 24, no. 4, pp. 320-327, 1976.  https://doi.org/10.1109/TASSP.1976.1162830
  15. The Korea Transport Institute, "Study on improvement of laws and regulations for personal mobilities", Research report, Nov. 2016. 
  16. Y.-J. Jung, J.-S. Lee, and C.-H. Oh, "Comparative analysis of the binary classification model for improving PM10 prediction performance", Journal of the Korea Institute of Information and Communication Engineering, vol. 25, no. 1, pp. 56-62, Jan. 2021.  https://doi.org/10.6109/JKIICE.2021.25.1.56
  17. G.-M. Park, and Y.-C. Bae, "Performance comparison of machine learning in the various kind of prediction", Journal of the Korea Institute of Electronic Communication Sciences, vol. 14, no. 1, pp. 169-178, 2019.  https://doi.org/10.13067/JKIECS.2019.14.1.169
  18. B. Al-Sheikh, A. H. A. Assaf, and A. B. A. Baker, "Sound source direction estimation in horizontal plane using microphone array", IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies(AEECT), Dec. 2013. 
  19. M. J. Jo and H. K. Kim, "Direction-of Arrival Estimation Using a 4-Channel Circular Microphone Array" Proceedings of Symposium of the Korean Institute of Communications and Information Sciences, pp. 237-238, Nov. 2017.