逆再生学習による追跡捕獲ロボットの行動獲得

¥660 JPY

セール売り切れ

税込

カテゴリ: 研究会(論文単位)

論文No: ST23035,CT23098

グループ名: 【C】電子・情報・システム部門システム/【C】電子・情報・システム部門制御合同研究会

発行日: 2023/11/29

タイトル(英語): Behavior Acquisition of Hunter Robot by Reverse Replay Learning

著者名: 川崎凌(大阪工業大学),住田尚也(大阪工業大学),重弘裕二(大阪工業大学)

著者名(英語): Ryo Kawasaki(Graduate School of Engineering, Osaka Institute of Technology),Naoya Sumida(Graduate School of Engineering, Osaka Institute of Technology),Yuji Shigehiro(Osaka Institute of Technology)

要約(日本語): 3体の捕獲者が1体の逃走者を追跡捕獲する逃走者追跡捕獲問題において、強化学習を用いて捕獲エージェントの行動原理を学習させる。しかし、通常の強化学習では、捕獲成功時に伴う報酬を十分に獲得することができないため、学習が難しい。そこで、捕獲成功時をゴールとして、ゴールからスタートへと逆向きに各エージェントを行動させて学習を行うことで、十分な報酬を獲得させる逆再生学習を提案する。

要約(英語): In the fugitive tracking and capture problem, reinforcement learning is used to learn the behavioral principles of the capture agent. However, with the usual method, learning is difficult because the reward for successful capture is not sufficient. Therefore, we propose a reverse replay learning method in which successful capture is the goal and each agent acts in reverse order from the goal to the start to obtain sufficient reward.

本誌: 2023年12月2日-2023年12月3日システム/制御合同研究会

本誌掲載ページ: 29-33 p

原稿種別: 日本語

PDFファイルサイズ: 1,034 Kバイト

販売タイプ冊子印刷（一般価格660円/会員価格440円） PDFダウンロード（一般価格330円/会員価格220円）

書籍サイズ A4

ページ数 5

数量

詳細を表示する

国/地域

逆再生学習による追跡捕獲ロボットの行動獲得

逆再生学習による追跡捕獲ロボットの行動獲得