外部による評価を報酬に組み入れる繰り返し動作の獲得手法の一検討

¥330 JPY

セール売り切れ

税込

カテゴリ: 研究会(論文単位)

論文No: ST15030

グループ名: 【C】電子・情報・システム部門システム研究会

発行日: 2015/12/06

タイトル(英語): A study on cyclic behavior acquisition including outside evaluation in reward

著者名: NGUYEN VAN BAC(筑波大学),澁谷長史(筑波大学)

著者名(英語): BAC NGUYEN VAN(University of Tsukuba),Takeshi Shibuya(University of Tsukuba)

要約(日本語): 従来の強化学習は目標が時刻によって変化する繰り返し動作を獲得することが難しい。周期報酬環境下の強化学習は繰り返し動作の獲得が容易である。しかし、外部による評価に適した動作の獲得に適用する場合、外部の評価への適応と繰り返し動作の獲得のトレードオフにより適切な報酬関数を設定することが困難である。本論文は外部の評価に適した繰り返し動作を獲得するための手法を提案し，その有効性を確認する。

要約(英語): In the framework of conventional reinforcement learning, it is difficult to acquire cyclic behavior. The periodic reinforcement learning is more suitable for cyclic task. However, in order to acquire the cyclic behavior adapted to outside evaluation, it is difficult to set proper reward function due to the trade-off between requisition to adapt to outside evaluation and cyclic behavior acquisition. This paper proposes a method for acquiring the cyclic behavior adapted to outside evaluation.

原稿種別: 日本語

PDFファイルサイズ: 892 Kバイト

販売タイプ PDFダウンロード（一般価格330円/会員価格220円）

書籍サイズ A4

ページ数 5

数量

詳細を表示する

国/地域

外部による評価を報酬に組み入れる繰り返し動作の獲得手法の一検討

外部による評価を報酬に組み入れる繰り返し動作の獲得手法の一検討