強化学習における学習時の初期配置が与える影響

¥660 JPY

セール売り切れ

税込

カテゴリ: 研究会(論文単位)

論文No: ST23036,CT23099

グループ名: 【C】電子・情報・システム部門システム/【C】電子・情報・システム部門制御合同研究会

発行日: 2023/11/29

タイトル(英語): Impact of initial placement in reinforcement learning

著者名: 住田尚也(大阪工業大学),川崎凌(大阪工業大学),重弘裕二(大阪工業大学)

著者名(英語): Naoya Sumida(Graduate School of Engineering, Osaka Institute of Technology),Ryo Kawasaki(Graduate School of Engineering, Osaka Institute of Technology),Yuji Shigehiro(Osaka Institute of Technology)

要約(日本語): 3体の捕獲者が1体の逃走者を追跡捕獲する逃走者捕獲問題において、強化学習を用いて捕獲エージェントの行動原理を学習させる。しかし、通常の強化学習では、捕獲成功時に伴う報酬を十分に獲得することができないため、学習が難しい。そこで、捕獲成功時に近い配置を初期配置とし、各エージェントを行動させて学習を行うことで、より多くの報酬を獲得させる手法を提案する。_x000D_

要約(英語): In the fugitive capture problem, in which three captors track and capture a single fugitive, reinforcement learning is used to learn the behavioral principles of the capturing agent. However, learning is difficult with ordinary reinforcement learning because the rewards associated with successful capture are not sufficiently acquired. Therefore, we propose a method to obtain more rewards by making each agent act and learn in an initial configuration that is close to the successful capture configuration._x000D_

本誌: 2023年12月2日-2023年12月3日システム/制御合同研究会

本誌掲載ページ: 35-39 p

原稿種別: 日本語

PDFファイルサイズ: 1,073 Kバイト

販売タイプ冊子印刷（一般価格660円/会員価格440円） PDFダウンロード（一般価格330円/会員価格220円）

書籍サイズ A4

ページ数 5

数量

詳細を表示する

国/地域

強化学習における学習時の初期配置が与える影響

強化学習における学習時の初期配置が与える影響