Applying CEE (CrossEntropyError) to improve performance of Q-Learning algorithm

Kang, Hyun-Gu;Seo, Dong-Sung;Lee, Byeong-seok;Kang, Min-Soo;

doi:10.24225/kjai.2017.5.1.1

Korean Journal of Artificial Intelligence (한국인공지능학회지)

Volume 5 Issue 1
/
Pages.1-9
/
2017
/
2508-7894(eISSN)

Korea Artificial Intelligence Association (한국인공지능학회)

DOI QR Code

Applying CEE (CrossEntropyError) to improve performance of Q-Learning algorithm

Q-learning 알고리즘이 성능 향상을 위한 CEE(CrossEntropyError)적용

Kang, Hyun-Gu (Department of Medical IT Marketing, Eulji University) ;
Seo, Dong-Sung (Department of Medical IT Marketing, Eulji University) ;
Lee, Byeong-seok (Department of Medical IT Marketing, Eulji University) ;
Kang, Min-Soo (Department of Medical IT Marketing, Eulji University)

Received : 2017.01.16
Accepted : 2017.06.20
Published : 2017.06.30

https://doi.org/10.24225/kjai.2017.5.1.1 Citation PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Recently, the Q-Learning algorithm, which is one kind of reinforcement learning, is mainly used to implement artificial intelligence system in combination with deep learning. Many research is going on to improve the performance of Q-Learning. Therefore, purpose of theory try to improve the performance of Q-Learning algorithm. This Theory apply Cross Entropy Error to the loss function of Q-Learning algorithm. Since the mean squared error used in Q-Learning is difficult to measure the exact error rate, the Cross Entropy Error, known to be highly accurate, is applied to the loss function. Experimental results show that the success rate of the Mean Squared Error used in the existing reinforcement learning was about 12% and the Cross Entropy Error used in the deep learning was about 36%. The success rate was shown.

Korean Journal of Artificial Intelligence (한국인공지능학회지)

Applying CEE (CrossEntropyError) to improve performance of Q-Learning algorithm

Q-learning 알고리즘이 성능 향상을 위한 CEE(CrossEntropyError)적용

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)