生放送番組向けの自動解説音声の挿入タイミング決定法

Presentation	2018-10-28 Timing determination method to insert an automated audio description in live television broadcast Manon Ichiki, Tadashi Kumano, Atsushi Imai, Tohru Takagi,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	We are conducting research on "automated audio description (ADD)" which automatically generates audio descriptions for visually impaired people to enjoy live TV programs. However, there is a problem that AAD overlaps with the live television commentary voice, making it difficult to hear each other's comment. It is necessary, therefore, to avoid their overlaps to understand both television commentary and ADD. In this paper, we propose timing determination method to insert ADDs into live sports programs. The method predicts the end of utterance of every live commentary by announcer and/or commentator, and ADDs can be inserted after live commentaries. In this method, difference between long and short term moving average of fundamental frequency (F0) extracted every 5ms is adopted to predict end of utterances. The effectiveness of proposed method was shown by comparing predicted and manually determined timing from live sports commentaries.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Audio Description / Visually Impaired People / The End of Utterance / Fundamental frequency / Moving Average
Paper #	SP2018-41,WIT2018-29
Date of Issue	2018-10-20 (SP, WIT)

Conference Information
Committee	WIT / SP
Conference Date	2018/10/27(2days)
Place (in Japanese)	(See Japanese page)
Place (in English)	Kyushu Institute of Technology(Kitakyushu)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair	Chikamune Wada(Kyushu Inst. of Tech.) / Yoichi Yamashita(Ritsumeikan Univ.)
Vice Chair	Daisuke Wakatsuki(Tsukuba Univ. of Tech.) / Akinobu Ri(Nagoya Inst. of Tech.)
Secretary	Daisuke Wakatsuki(AIST) / Akinobu Ri(Nagoya Inst. of Tech.)
Assistant	Manabi Miyagi(Tsukuba Univ. of Tech.) / Takeaki Shionome(Teikyo Univ.) / Takashi Handa(Saitama Industrial Tech. Center) / Tomoki Koriyama(Tokyo Inst. of Tech.) / Satoshi Kobashikawa(NTT)

Paper Information
Registration To	Technical Committee on Well-being Information Technology / Technical Committee on Speech
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Timing determination method to insert an automated audio description in live television broadcast
Sub Title (in English)	*
Keyword(1)	Audio Description
Keyword(2)	Visually Impaired People
Keyword(3)	The End of Utterance
Keyword(4)	Fundamental frequency
Keyword(5)	Moving Average
1st Author's Name	Manon Ichiki
1st Author's Affiliation	NHK Science&Technology Research Laboratories(NHK)
2nd Author's Name	Tadashi Kumano
2nd Author's Affiliation	NHK Science&Technology Research Laboratories(NHK)
3rd Author's Name	Atsushi Imai
3rd Author's Affiliation	NHK Science&Technology Research Laboratories(NHK)
4th Author's Name	Tohru Takagi
4th Author's Affiliation	NHK Engineering Systems(NHK-ES)
Date	2018-10-28
Paper #	SP2018-41,WIT2018-29
Volume (vol)	vol.118
Number (no)	SP-269,WIT-270
Page	pp.pp.45-50(SP), pp.45-50(WIT),
#Pages	6
Date of Issue	2018-10-20 (SP, WIT)