Presentation 2018-10-28
Timing determination method to insert an automated audio description in live television broadcast
Manon Ichiki, Tadashi Kumano, Atsushi Imai, Tohru Takagi,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) We are conducting research on "automated audio description (ADD)" which automatically generates audio descriptions for visually impaired people to enjoy live TV programs. However, there is a problem that AAD overlaps with the live television commentary voice, making it difficult to hear each other's comment. It is necessary, therefore, to avoid their overlaps to understand both television commentary and ADD. In this paper, we propose timing determination method to insert ADDs into live sports programs. The method predicts the end of utterance of every live commentary by announcer and/or commentator, and ADDs can be inserted after live commentaries. In this method, difference between long and short term moving average of fundamental frequency (F0) extracted every 5ms is adopted to predict end of utterances. The effectiveness of proposed method was shown by comparing predicted and manually determined timing from live sports commentaries.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Audio Description / Visually Impaired People / The End of Utterance / Fundamental frequency / Moving Average
Paper # SP2018-41,WIT2018-29
Date of Issue 2018-10-20 (SP, WIT)

Conference Information
Committee WIT / SP
Conference Date 2018/10/27(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Kyushu Institute of Technology(Kitakyushu)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Chikamune Wada(Kyushu Inst. of Tech.) / Yoichi Yamashita(Ritsumeikan Univ.)
Vice Chair Daisuke Wakatsuki(Tsukuba Univ. of Tech.) / Akinobu Ri(Nagoya Inst. of Tech.)
Secretary Daisuke Wakatsuki(AIST) / Akinobu Ri(Nagoya Inst. of Tech.)
Assistant Manabi Miyagi(Tsukuba Univ. of Tech.) / Takeaki Shionome(Teikyo Univ.) / Takashi Handa(Saitama Industrial Tech. Center) / Tomoki Koriyama(Tokyo Inst. of Tech.) / Satoshi Kobashikawa(NTT)

Paper Information
Registration To Technical Committee on Well-being Information Technology / Technical Committee on Speech
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Timing determination method to insert an automated audio description in live television broadcast
Sub Title (in English) *
Keyword(1) Audio Description
Keyword(2) Visually Impaired People
Keyword(3) The End of Utterance
Keyword(4) Fundamental frequency
Keyword(5) Moving Average
1st Author's Name Manon Ichiki
1st Author's Affiliation NHK Science&Technology Research Laboratories(NHK)
2nd Author's Name Tadashi Kumano
2nd Author's Affiliation NHK Science&Technology Research Laboratories(NHK)
3rd Author's Name Atsushi Imai
3rd Author's Affiliation NHK Science&Technology Research Laboratories(NHK)
4th Author's Name Tohru Takagi
4th Author's Affiliation NHK Engineering Systems(NHK-ES)
Date 2018-10-28
Paper # SP2018-41,WIT2018-29
Volume (vol) vol.118
Number (no) SP-269,WIT-270
Page pp.pp.45-50(SP), pp.45-50(WIT),
#Pages 6
Date of Issue 2018-10-20 (SP, WIT)