Presentation | 2017-12-17 Action Sequence Recognition in Videos by Combining a CTC Network with a Statistical Language Model Mengxi Lin, Nakamasa Inoue, Koichi Shinoda, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Action sequence recognition aims to recognize what actions occur in a video and their temporal order. In this paper, we propose to combine an LSTM network trained with Connectionist Temporal Classification (CTC) with a statistical language model for action sequence recognition. The statistical language model captures the relations between action instances, which are hardly learned by the CTC network. Our experiments on the Breakfast dataset show that the statistical language model can significantly boost the recognition accuracy of the CTC network, from 37.0% to 43.4%. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | connectionist temporal classification / action sequence recognition / statistical language model / weakly supervised learning |
Paper # | PRMU2017-101 |
Date of Issue | 2017-12-10 (PRMU) |
Conference Information | |
Committee | PRMU |
---|---|
Conference Date | 2017/12/16(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | Shinichi Sato(NII) |
Vice Chair | Hironobu Fujiyoshi(Chubu Univ.) / Yoshihisa Ijiri(Omron) |
Secretary | Hironobu Fujiyoshi(AIST) / Yoshihisa Ijiri(NAIST) |
Assistant | Masato Ishii(NEC) / Yusuke Sugano(Osaka Univ.) |
Paper Information | |
Registration To | Technical Committee on Pattern Recognition and Media Understanding |
---|---|
Language | ENG |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Action Sequence Recognition in Videos by Combining a CTC Network with a Statistical Language Model |
Sub Title (in English) | |
Keyword(1) | connectionist temporal classification |
Keyword(2) | action sequence recognition |
Keyword(3) | statistical language model |
Keyword(4) | weakly supervised learning |
1st Author's Name | Mengxi Lin |
1st Author's Affiliation | Tokyo Institute of Technology(Tokyo Tech) |
2nd Author's Name | Nakamasa Inoue |
2nd Author's Affiliation | Tokyo Institute of Technology(Tokyo Tech) |
3rd Author's Name | Koichi Shinoda |
3rd Author's Affiliation | Tokyo Institute of Technology(Tokyo Tech) |
Date | 2017-12-17 |
Paper # | PRMU2017-101 |
Volume (vol) | vol.117 |
Number (no) | PRMU-362 |
Page | pp.pp.1-6(PRMU), |
#Pages | 6 |
Date of Issue | 2017-12-10 (PRMU) |