Paper Abstract and Keywords |
Presentation |
2009-01-29 17:00
Reinforcement Learning of Optimal Supervisor based on the Worst-Case Behavior Kouji Kajiwara, Tatsushi Yamasaki (Setsunan Univ.) CST2008-49 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
Ramadge and Wonham proposed the supervisory control, which is a framework for logical control of discrete event systems. However, in the ordinary supervisory control, the costs for occurence and disabling of events have not been considered.
This paper proposes a synthesis method of the supervisor based on the worst-case behavior of discrete event systems. We introduce the new value functions for the assigned control patterns.
The new value functions are not based on the expected total rewards, but based on the most undesirable event ocurrence in the assigned control pattern.
In the proposed method, the supervisor learns how to assign the control pattern based on reinforcement learning so as to maximize the value functions.
We show the efficiency of the proposed method by computer simulation. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Discrete event systems / supervisory control / reinforcement learning / / / / / |
Reference Info. |
IEICE Tech. Rep., vol. 108, Jan. 2009. |
Paper # |
|
Date of Issue |
2009-01-22 (CST) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
CST2008-49 |
Conference Information |
Committee |
MSS |
Conference Date |
2009-01-29 - 2009-01-30 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Kanagawa Industrial Promotion Center |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Concurrent Systems |
Paper Information |
Registration To |
MSS |
Conference Code |
2009-01-CST |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Reinforcement Learning of Optimal Supervisor based on the Worst-Case Behavior |
Sub Title (in English) |
|
Keyword(1) |
Discrete event systems |
Keyword(2) |
supervisory control |
Keyword(3) |
reinforcement learning |
Keyword(4) |
|
Keyword(5) |
|
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Kouji Kajiwara |
1st Author's Affiliation |
Setsunan University (Setsunan Univ.) |
2nd Author's Name |
Tatsushi Yamasaki |
2nd Author's Affiliation |
Setsunan University (Setsunan Univ.) |
3rd Author's Name |
|
3rd Author's Affiliation |
() |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2009-01-29 17:00:00 |
Presentation Time |
25 minutes |
Registration for |
MSS |
Paper # |
CST2008-49 |
Volume (vol) |
vol.108 |
Number (no) |
no.415 |
Page |
pp.45-50 |
#Pages |
6 |
Date of Issue |
2009-01-22 (CST) |