Action Sequence Recognition in Videos by Combining a CTC Network with a Statistical Language Model

Presentation	2017-12-17 Action Sequence Recognition in Videos by Combining a CTC Network with a Statistical Language Model Mengxi Lin, Nakamasa Inoue, Koichi Shinoda,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	Action sequence recognition aims to recognize what actions occur in a video and their temporal order. In this paper, we propose to combine an LSTM network trained with Connectionist Temporal Classification (CTC) with a statistical language model for action sequence recognition. The statistical language model captures the relations between action instances, which are hardly learned by the CTC network. Our experiments on the Breakfast dataset show that the statistical language model can significantly boost the recognition accuracy of the CTC network, from 37.0% to 43.4%.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	connectionist temporal classification / action sequence recognition / statistical language model / weakly supervised learning
Paper #	PRMU2017-101
Date of Issue	2017-12-10 (PRMU)

Conference Information
Committee	PRMU
Conference Date	2017/12/16(2days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair	Shinichi Sato(NII)
Vice Chair	Hironobu Fujiyoshi(Chubu Univ.) / Yoshihisa Ijiri(Omron)
Secretary	Hironobu Fujiyoshi(AIST) / Yoshihisa Ijiri(NAIST)
Assistant	Masato Ishii(NEC) / Yusuke Sugano(Osaka Univ.)

Paper Information
Registration To	Technical Committee on Pattern Recognition and Media Understanding
Language	ENG
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Action Sequence Recognition in Videos by Combining a CTC Network with a Statistical Language Model
Sub Title (in English)
Keyword(1)	connectionist temporal classification
Keyword(2)	action sequence recognition
Keyword(3)	statistical language model
Keyword(4)	weakly supervised learning
1st Author's Name	Mengxi Lin
1st Author's Affiliation	Tokyo Institute of Technology(Tokyo Tech)
2nd Author's Name	Nakamasa Inoue
2nd Author's Affiliation	Tokyo Institute of Technology(Tokyo Tech)
3rd Author's Name	Koichi Shinoda
3rd Author's Affiliation	Tokyo Institute of Technology(Tokyo Tech)
Date	2017-12-17
Paper #	PRMU2017-101
Volume (vol)	vol.117
Number (no)	PRMU-362
Page	pp.pp.1-6(PRMU),
#Pages	6
Date of Issue	2017-12-10 (PRMU)