Reinforcement Learning using Propagation of Goal-State-Value

Kim, Byeong-Cheon;Yun, Byeong-Ju;

한국정보처리학회논문지 (The Transactions of the Korea Information Processing Society)

제6권5호
/
Pages.1303-1311
/
1999
/
1226-9190(pISSN)

한국정보처리학회 (Korea Information Processing Society)

목표상태 값 전파를 이용한 강화 학습

Reinforcement Learning using Propagation of Goal-State-Value

김병천 (명지대학교 대학원 컴퓨터공학과/정보통신교육연구센터) ;
윤병주 (명지대학교 컴퓨터공학과/정보통신교육연구센터)

발행 : 1999.05.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

In order to learn in dynamic environments, reinforcement learning algorithms like Q-learning, TD(0)-learning, TD(λ)-learning have been proposed. however, most of them have a drawback of very slow learning because the reinforcement value is given when they reach their goal state. In this thesis, we have proposed a reinforcement learning method that can approximate fast to the goal state in maze environments. The proposed reinforcement learning method is separated into global learning and local learning, and then it executes learning. Global learning is a learning that uses the replacing eligibility trace method to search the goal state. In local learning, it propagates the goal state value that has been searched through global learning to neighboring sates, and then searches goal state in neighboring states. we can show through experiments that the reinforcement learning method proposed in this thesis can find out an optimal solution faster than other reinforcement learning methods like Q-learning, TD(o)learning and TD(λ)-learning.

한국정보처리학회논문지 (The Transactions of the Korea Information Processing Society)

목표상태 값 전파를 이용한 강화 학습

Reinforcement Learning using Propagation of Goal-State-Value

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)