DOI QR코드

DOI QR Code

A HMM-based Method of Reducing the Time for Processing Sound Commands in Computer Games

컴퓨터 게임에서 HMM 기반의 명령어 신호 처리 시간 단축을 위한 방법

  • Park, Dosaeng (Computer Science and Engineering Major, Graduate School, Hankuk University of Foreign Studies) ;
  • Kim, Sangchul (Computer Science and Engineering Major, Graduate School, Hankuk University of Foreign Studies)
  • 박도생 (한국외국어대학교 컴퓨터 및 전자시스템 공학부) ;
  • 김상철 (한국외국어대학교 컴퓨터 및 전자시스템 공학부)
  • Received : 2016.03.10
  • Accepted : 2016.04.14
  • Published : 2016.04.20

Abstract

In computer games, most of GUI methods are keyboards, mouses and touch screens. The total time of processing the sound commands for games is the sum of input time and recognition time. In this paper, we propose a method for taking only the prefixes of the input signals for sound commands, resulting in the reduced the total processing time, instead of taking the whole input signals. In our method, command sounds are recognized using HMM(Hidden Markov Model), where separate HMM's are built for the whole input signals and their prefix signals. We experiment our proposed method with representative commands of platform games. The experiment shows that the total processing time of input command signals reduces without decreasing recognition rate significantly. The study will contribute to enhance the versatility of GUI for computer games.

컴퓨터 게임에서 대부분의 사용자 인터페이스 방법은 키보드, 마우스, 터치스크린이다. 사운드 형태 명령어의 전체 처리 시간은 크게 명령어 입력 시간과 인식 시간으로 구성된다. 본 논문은 명령어 신호 전체를 입력받지 않고 일부 앞부분 신호만을 받음으로써, 입력 시간을 줄여 전체 처리 시간을 단축하는 방법을 제안한다. 우리의 방법에서는 HMM(Hidden Markov Process)를 이용해 명령어 신호를 인식하는데, 전체 신호 및 부분 신호들에 대해 별도의 HMM을 구성한다. 플랫홈 게임의 대표 명령어들을 음성과 손바닥 소리로 표현해, 본 논문의 방법을 실험했다. 실험 결과, 인식률의 큰 저하 없이 명령어 처리 시간을 줄임을 알 수 있었다. 본 연구는 게임의 사용자 인터페이스 방법을 다양화하는데 기여할 것이다.

Keywords

References

  1. Zhang Jie, Zhao Ji, Bai Shuanhu, and Huang Zhiyong, "Applying Speech Interface to Mahjong Game", Proceedings of 10th International Conference on Multimedia Modelling, 2004, pp.86-92.
  2. http: //en.wikipedia.org/wiki/Hidden_Markov_model
  3. Alexander Franz, Brian Milch, Searching the Web by voice, Proceeding of Proceedings of the 19th International Conference on Computational Linguistics, Vol. 2, 2002, pp.1-5.
  4. R. Rogoff, "Voice Activated GUI-the Next User Interface", Proceedings of Professional Communication Conference, 2001, pp.117-120.
  5. H Sakoe, R Isotani, K Yoshida, KI Iso, and T Watanabe, "Speaker-Independent Word Recognition Using Dynamic, Programming Neural Networks", Proceeding of International Conference on Acoustics, Speech, and Signal Processing, 1989, pp.29-32.
  6. J. -C. Bolot, S. Fosse-Parisis, "Adding Voice to Distributed Games on the Internet", Proceedings of Seventeenth Annual Joint Conference of the IEEE Computer and Communications Societies, 1998, Vol. 2, pp.480-487.
  7. Chi-Wen Fann, Jehn-Ruey Jiang, and Jih-Wei Wu, "Peer-to-Peer Immersive Voice Communication for Massively Multiplayer Online Games", International Conference on Parallel and Distributed Systems, 2011, pp.759-764.
  8. Jehn-Ruey Jiang, Hung-Shiang Chen, "Peer-to-Peer AOI voice chatting for massively multiplayer online games", International Conference on Parallel and Distributed Systems, 2007, Vol. 2, pp.1-8.
  9. Kiyhoshi Nosu, et. al, "Real Time Emotion-Diagnosis of Video Game Players from their Facial Expressions and its Applications to Voice Feed-Backing to Game Players", International Conference on Machine Learning and Cybernetics, 2007, Vol. 4, pp.2208-2212.
  10. XiaoJie Yuan, Jing Fan, "Design and implementation of voice controlled Tetris game based on Microsoft SDK", Proceedings of International Conference on Multimedia Technology, 2011, pp.275-278.
  11. Izaya Nishimuta, et. al, "A Robot Qquizmaster That Can Localize, Separate, and Recognize Simultaneous Utterances for a Fastest-voice-first Quiz Game", International Conference on Humanoid Robots (Humanoids), 2014, pp.967-972.
  12. Hiroaki Nanjo, et. al, "A Fundamental Study of Novel Speech Interface for Computer Games", Proceedings of 13th International Symposium on Consumer Electronics, 2009. pp.558-560.
  13. Y. Sriboonruang, P. Kumhom, and K. Chamnongthai, "Visual Hand Gesture Interface for Computer Board Game Control", IEEE Tenth International Symposium on Consumer Electronics, 2006, pp.1-5.
  14. J Payne, et. al, "Gameplay Issues in the Design of Spatial 3D Gestures for Video Ggames", Extended Abstracts on Human Factors in Computing Systems. 2006, pp.1217-1222.
  15. Simon Gunter, Horst Bunke, "Optimizing the Number of States, Training Iterations and Gaussians in an HMM-based Handwritten Word Recognizer", Proceedings of the Seventh International Conference on Document Analysis and Recognition, Vol. 1, pp.472-496.
  16. Nilu Singh, R.A Khan, and Raj Shree, "MFCC and Prosodic Feature Extraction Techniques: A Comparative Study", International Journal of Computer Applications, 54(1), 2012, pp.9-13. https://doi.org/10.5120/8529-2061