強化学習システムにおける学習周期の無作為抽出による適応

¥330 JPY

セール売り切れ

税込

カテゴリ: 研究会(論文単位)

論文No: ST18084

グループ名: 【C】電子・情報・システム部門システム研究会

発行日: 2018/09/26

タイトル(英語): Adaptation by Random Sampling of Learning Cycle in Reinforcement Learning System

著者名: 野津亮(大阪府立大学),安田功嗣(大阪府立大学),生方誠希(大阪府立大学),本多克宏(大阪府立大学)

著者名(英語): Akira Notsu(Osaka Prefecture University),Koji Yasuda(Osaka Prefecture University),Seiki Ubukata(Osaka Prefecture University),Katsuhiro Honda(Osaka Prefecture University)

要約(日本語): 強化学習システムでは，設計者が設定する学習周期にその学習の効率が大きく左右される．学習周期が短いと状態数が爆発的に増えてしまい学習が終わらず，逆に，学習周期が長いと大雑把な学習しかできない．本研究では，うまく学習できない，あるいは学習速度が遅い場合，学習周期をランダムに変更することでシステムを適応させるための枠組みを提案する．数値実験によってその有効性を示す．

要約(英語): In the reinforcement learning system, the learning efficiency largely depends on the learning cycle set by the designer. If the learning cycle is short, the number of states explosively increases and the learning is not ended. Conversely, if the learning cycle is long, only a rough learning can be done. In this research, we propose a framework for adapting the system by randomly changing the learning cycle when it can not be learned well or the learning speed is slow. It shows its effectiveness by numerical experiment.

原稿種別: 日本語

PDFファイルサイズ: 2,454 Kバイト

販売タイプ PDFダウンロード（一般価格330円/会員価格220円）

書籍サイズ A4

ページ数 4

数量

詳細を表示する

国/地域

強化学習システムにおける学習周期の無作為抽出による適応

強化学習システムにおける学習周期の無作為抽出による適応