Presentation 2020-01-27
Suppression of Dialog System Speech by Embedding Marker Signal into High Frequency Band
Shunsuke Saga, Akinori Ito,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Spoken dialog systems have become popular and are used in a home environment, such as smart speakers. A problem will occur when two or more smart speakers are in the same environment, in which a dialog system misdetects the other dialog system’s voice as a user’s voice. In this paper, a method to mute synthesized speech is proposed to prevent a speech recognizer from recognizing speech uttered by a machine. The audio watermark technique is used to indicate that a machine utters the speech, and the speech recognizer attenuates the observed speech if it contains the watermark. The watermark is embedded in high frequency so that the watermark is not perceived by humans and is robustly extracted. From the experimental result, it was found that the proposed method robustly determine the existence of the watermark.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) speech recognition / spoken dialog system / audio watarmarking
Paper # EMM2019-94
Date of Issue 2020-01-20 (EMM)

Conference Information
Committee EMM
Conference Date 2020/1/27(1days)
Place (in Japanese) (See Japanese page)
Place (in English) Tohoku Univ.
Topics (in Japanese) (See Japanese page)
Topics (in English) Sense of Presence, Universal Media, Digital Entertainment, etc.
Chair Masaki Kawamura(Yamaguchi Univ.)
Vice Chair Motoi Iwata(Osaka Prefecture Univ.) / Tetsuya Kojima(NIT,Tokyo College)
Secretary Motoi Iwata(NIT, Nagano College) / Tetsuya Kojima(Nagase)
Assistant Masaki Inamura(Tokyo Denki Univ.) / Kazuhiro Kono(Kansai Univ.)

Paper Information
Registration To Technical Committee on Enriched MultiMedia
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Suppression of Dialog System Speech by Embedding Marker Signal into High Frequency Band
Sub Title (in English)
Keyword(1) speech recognition
Keyword(2) spoken dialog system
Keyword(3) audio watarmarking
1st Author's Name Shunsuke Saga
1st Author's Affiliation Tohoku University(Tohoku Univ.)
2nd Author's Name Akinori Ito
2nd Author's Affiliation Tohoku University(Tohoku Univ.)
Date 2020-01-27
Paper # EMM2019-94
Volume (vol) vol.119
Number (no) EMM-396
Page pp.pp.1-6(EMM),
#Pages 6
Date of Issue 2020-01-20 (EMM)