DOI QR코드

DOI QR Code

Applying CEE (CrossEntropyError) to improve performance of Q-Learning algorithm

Q-learning 알고리즘이 성능 향상을 위한 CEE(CrossEntropyError)적용

  • Kang, Hyun-Gu (Department of Medical IT Marketing, Eulji University) ;
  • Seo, Dong-Sung (Department of Medical IT Marketing, Eulji University) ;
  • Lee, Byeong-seok (Department of Medical IT Marketing, Eulji University) ;
  • Kang, Min-Soo (Department of Medical IT Marketing, Eulji University)
  • Received : 2017.01.16
  • Accepted : 2017.06.20
  • Published : 2017.06.30

Abstract

Recently, the Q-Learning algorithm, which is one kind of reinforcement learning, is mainly used to implement artificial intelligence system in combination with deep learning. Many research is going on to improve the performance of Q-Learning. Therefore, purpose of theory try to improve the performance of Q-Learning algorithm. This Theory apply Cross Entropy Error to the loss function of Q-Learning algorithm. Since the mean squared error used in Q-Learning is difficult to measure the exact error rate, the Cross Entropy Error, known to be highly accurate, is applied to the loss function. Experimental results show that the success rate of the Mean Squared Error used in the existing reinforcement learning was about 12% and the Cross Entropy Error used in the deep learning was about 36%. The success rate was shown.

Keywords