Safety Critical Systemに対する強化学習適用に関する検討(コンカレント工学一般、及び、CSTソリューションコンペティション2007発表会)

藤原 真一; 宮本 俊幸; 熊谷 貞俊

講演名	2008-01-29 Safety Critical Systemに対する強化学習適用に関する検討(コンカレント工学一般、及び、CSTソリューションコンペティション2007発表会) 藤原真一, 宮本俊幸, 熊谷貞俊,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	マルチエージェント環境における,自律的な協調行動獲得の枠組みとして強化学習が注目されている.強化学習は試行錯誤や知識利用による行動の結果得られる報酬を手掛かりとして環境に適応する学習制御の枠組みである.本報告では,既に提案されているSafety Critical System (SCS)の障害抑制機構に対し,強化学習の適用可能性について検討する.SCSでは,システムをいくつかのセクションに分割し,発生する障害をあるレベル以下に抑制することでシステムの動作を保証できると想定しており,本研究では,障害抑制機構の意思決定に強化学習を用いる.SCSは,各セクションで求められるレベルに応じて,障害の抑制に差をつけるという点でQuality of Serviceの概念と類似しており,既存のマルチエージェント強化学習のテストベッドとは異なるものである.強化学習法として,元来シングルエージェント環境を対象に開発されたQ-Learningを用いたシミュレーションの結果,強化学習が適用可能であると判断したので報告する.
抄録(英)	Reinforcement Learning have been attracting many researchers attention as a framework of autonomous cooperative behavior acquisition. In this framework, actor adapts to an environment with the help of a reward obtained as a result of the trial-and-error or application of self knowledge. Now, we discuss a possibility to apply a multi-agent reinforcement learning to already proposed damage control mechanism of Safety Critical System(SCS). In the SCS, we suppose the total system is divided into some sections and performance is secured by controlling damages under a certain level. Then we employ a reinforcement learning method for a decision-making of damage control mechanism. SCS is different from existing other testbed of multi-agent reinforcement learning because it is similar to the concept of the Quality of Service in that we control damages differently according to each required damage level. According to a simulation the Q-learning is employed as a reinforcement learning method, we regarded the reinforcement learning is possible to apply to the SCS.
キーワード(和)	マルチエージェントシステム / 強化学習 / Q学習 / Safety Critical System
キーワード(英)	Multi-agent System / Reinforcement Learning / Q-learning / Safety Critical System
資料番号	CST2007-53
発行日

研究会情報
研究会	CST
開催期間	2008/1/22(から1日開催)
開催地（和）
開催地（英）
テーマ（和）
テーマ（英）
委員長氏名（和）
委員長氏名（英）
副委員長氏名（和）
副委員長氏名（英）
幹事氏名（和）
幹事氏名（英）
幹事補佐氏名（和）
幹事補佐氏名（英）

講演論文情報詳細
申込み研究会	Concurrent System Technology (CST)
本文の言語	JPN
タイトル（和）	Safety Critical Systemに対する強化学習適用に関する検討(コンカレント工学一般、及び、CSTソリューションコンペティション2007発表会)
サブタイトル（和）
タイトル（英）	A Study on Applying Reinforcement Learning to a Safety Critical System
サブタイトル（和）
キーワード(1)（和/英）	マルチエージェントシステム / Multi-agent System
キーワード(2)（和/英）	強化学習 / Reinforcement Learning
キーワード(3)（和/英）	Q学習 / Q-learning
キーワード(4)（和/英）	Safety Critical System / Safety Critical System
第 1 著者氏名（和/英）	藤原真一 / Shinichi FUJIWARA
第 1 著者所属（和/英）	大阪大学大学院工学研究科電気電子情報工学専攻 Division of Electrical, Electronic and Information Eng., Graduate School of Eng., Osaka University
第 2 著者氏名（和/英）	宮本俊幸 / Toshiyuki MIYAMOTO
第 2 著者所属（和/英）	大阪大学大学院工学研究科電気電子情報工学専攻 Division of Electrical, Electronic and Information Eng., Graduate School of Eng., Osaka University
第 3 著者氏名（和/英）	熊谷貞俊 / Sadatoshi KUMAGAI
第 3 著者所属（和/英）	大阪大学大学院工学研究科電気電子情報工学専攻 Division of Electrical, Electronic and Information Eng., Graduate School of Eng., Osaka University
発表年月日	2008-01-29
資料番号	CST2007-53
巻番号（vol）	vol.107
号番号（no）	472
ページ範囲	pp.-
ページ数	6
発行日