Presentation | 2018-10-28 Timing determination method to insert an automated audio description in live television broadcast Manon Ichiki, Tadashi Kumano, Atsushi Imai, Tohru Takagi, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | We are conducting research on "automated audio description (ADD)" which automatically generates audio descriptions for visually impaired people to enjoy live TV programs. However, there is a problem that AAD overlaps with the live television commentary voice, making it difficult to hear each other's comment. It is necessary, therefore, to avoid their overlaps to understand both television commentary and ADD. In this paper, we propose timing determination method to insert ADDs into live sports programs. The method predicts the end of utterance of every live commentary by announcer and/or commentator, and ADDs can be inserted after live commentaries. In this method, difference between long and short term moving average of fundamental frequency (F0) extracted every 5ms is adopted to predict end of utterances. The effectiveness of proposed method was shown by comparing predicted and manually determined timing from live sports commentaries. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Audio Description / Visually Impaired People / The End of Utterance / Fundamental frequency / Moving Average |
Paper # | SP2018-41,WIT2018-29 |
Date of Issue | 2018-10-20 (SP, WIT) |
Conference Information | |
Committee | WIT / SP |
---|---|
Conference Date | 2018/10/27(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Kyushu Institute of Technology(Kitakyushu) |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | Chikamune Wada(Kyushu Inst. of Tech.) / Yoichi Yamashita(Ritsumeikan Univ.) |
Vice Chair | Daisuke Wakatsuki(Tsukuba Univ. of Tech.) / Akinobu Ri(Nagoya Inst. of Tech.) |
Secretary | Daisuke Wakatsuki(AIST) / Akinobu Ri(Nagoya Inst. of Tech.) |
Assistant | Manabi Miyagi(Tsukuba Univ. of Tech.) / Takeaki Shionome(Teikyo Univ.) / Takashi Handa(Saitama Industrial Tech. Center) / Tomoki Koriyama(Tokyo Inst. of Tech.) / Satoshi Kobashikawa(NTT) |
Paper Information | |
Registration To | Technical Committee on Well-being Information Technology / Technical Committee on Speech |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Timing determination method to insert an automated audio description in live television broadcast |
Sub Title (in English) | * |
Keyword(1) | Audio Description |
Keyword(2) | Visually Impaired People |
Keyword(3) | The End of Utterance |
Keyword(4) | Fundamental frequency |
Keyword(5) | Moving Average |
1st Author's Name | Manon Ichiki |
1st Author's Affiliation | NHK Science&Technology Research Laboratories(NHK) |
2nd Author's Name | Tadashi Kumano |
2nd Author's Affiliation | NHK Science&Technology Research Laboratories(NHK) |
3rd Author's Name | Atsushi Imai |
3rd Author's Affiliation | NHK Science&Technology Research Laboratories(NHK) |
4th Author's Name | Tohru Takagi |
4th Author's Affiliation | NHK Engineering Systems(NHK-ES) |
Date | 2018-10-28 |
Paper # | SP2018-41,WIT2018-29 |
Volume (vol) | vol.118 |
Number (no) | SP-269,WIT-270 |
Page | pp.pp.45-50(SP), pp.45-50(WIT), |
#Pages | 6 |
Date of Issue | 2018-10-20 (SP, WIT) |