Presentation | 2015-06-23 Optimal Algorithms in Dueling Bandit Problem Junpei Komiyama, Junya Honda, Hisashi Kashima, Hiroshi Nakagawa, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | We study the K-armed dueling bandit problem, a variation of the standard stochastic bandit problem where the feedback is limited to relative comparisons of a pair of arms. Algorithms that are inspired by the Deterministic Minimum Empirical Divergence algorithm (Honda and Takemura, 2010) are proposed. The effectiveness of the proposed algorithms are assessed both theoretically and empirically. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | multi-armed bandit problem / dueling bandit problem / online learning / preference elicitation |
Paper # | IBISML2015-14 |
Date of Issue | 2015-06-16 (IBISML) |
Conference Information | |
Committee | NC / IPSJ-BIO / IBISML / IPSJ-MPS |
---|---|
Conference Date | 2015/6/23(3days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Okinawa Institute of Science and Technology |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Machine Learning Approach to Biodata Mining, and General |
Chair | Toshimichi Saito(Hosei Univ.) / Masakazu Sekijima(東工大) / Takashi Washio(Osaka Univ.) / Hayaru Shouno(電通大) |
Vice Chair | Shigeo Sato(Tohoku Univ.) / / Kenji Fukumizu(ISM) / Masashi Sugiyama(Tokyo Inst. of Tech.) |
Secretary | Shigeo Sato(Kyushu Inst. of Tech.) / (Kyoto Sangyo Univ.) / Kenji Fukumizu(京大) / Masashi Sugiyama(お茶の水女子大) / (OIST) |
Assistant | Hiroyuki Kanbara(Tokyo Inst. of Tech.) / Hisanao Akima(Tohoku Univ.) / / Koji Tsuda(Univ. of Tokyo) / Hisashi Kashima(Kyoto Univ.) |
Paper Information | |
Registration To | Technical Committee on Neurocomputing / Special Interest Group on Bioinformatics and Genomics / Technical Committee on Infomation-Based Induction Sciences and Machine Learning / Special Interest Group on Mathematical Modeling and Problem Solving |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Optimal Algorithms in Dueling Bandit Problem |
Sub Title (in English) | * |
Keyword(1) | multi-armed bandit problem |
Keyword(2) | dueling bandit problem |
Keyword(3) | online learning |
Keyword(4) | preference elicitation |
1st Author's Name | Junpei Komiyama |
1st Author's Affiliation | The University of Tokyo(U-Tokyo) |
2nd Author's Name | Junya Honda |
2nd Author's Affiliation | The University of Tokyo(U-Tokyo) |
3rd Author's Name | Hisashi Kashima |
3rd Author's Affiliation | Kyoto University(Kyoto University) |
4th Author's Name | Hiroshi Nakagawa |
4th Author's Affiliation | The University of Tokyo(U-Tokyo) |
Date | 2015-06-23 |
Paper # | IBISML2015-14 |
Volume (vol) | vol.115 |
Number (no) | IBISML-112 |
Page | pp.pp.87-94(IBISML), |
#Pages | 8 |
Date of Issue | 2015-06-16 (IBISML) |