高周波帯域への信号埋め込みによる対話システム発話の抑圧

Presentation	2020-01-27 Suppression of Dialog System Speech by Embedding Marker Signal into High Frequency Band Shunsuke Saga, Akinori Ito,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	Spoken dialog systems have become popular and are used in a home environment, such as smart speakers. A problem will occur when two or more smart speakers are in the same environment, in which a dialog system misdetects the other dialog system’s voice as a user’s voice. In this paper, a method to mute synthesized speech is proposed to prevent a speech recognizer from recognizing speech uttered by a machine. The audio watermark technique is used to indicate that a machine utters the speech, and the speech recognizer attenuates the observed speech if it contains the watermark. The watermark is embedded in high frequency so that the watermark is not perceived by humans and is robustly extracted. From the experimental result, it was found that the proposed method robustly determine the existence of the watermark.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	speech recognition / spoken dialog system / audio watarmarking
Paper #	EMM2019-94
Date of Issue	2020-01-20 (EMM)

Conference Information
Committee	EMM
Conference Date	2020/1/27(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)	Tohoku Univ.
Topics (in Japanese)	(See Japanese page)
Topics (in English)	Sense of Presence, Universal Media, Digital Entertainment, etc.
Chair	Masaki Kawamura(Yamaguchi Univ.)
Vice Chair	Motoi Iwata(Osaka Prefecture Univ.) / Tetsuya Kojima(NIT,Tokyo College)
Secretary	Motoi Iwata(NIT, Nagano College) / Tetsuya Kojima(Nagase)
Assistant	Masaki Inamura(Tokyo Denki Univ.) / Kazuhiro Kono(Kansai Univ.)

Paper Information
Registration To	Technical Committee on Enriched MultiMedia
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Suppression of Dialog System Speech by Embedding Marker Signal into High Frequency Band
Sub Title (in English)
Keyword(1)	speech recognition
Keyword(2)	spoken dialog system
Keyword(3)	audio watarmarking
1st Author's Name	Shunsuke Saga
1st Author's Affiliation	Tohoku University(Tohoku Univ.)
2nd Author's Name	Akinori Ito
2nd Author's Affiliation	Tohoku University(Tohoku Univ.)
Date	2020-01-27
Paper #	EMM2019-94
Volume (vol)	vol.119
Number (no)	EMM-396
Page	pp.pp.1-6(EMM),
#Pages	6
Date of Issue	2020-01-20 (EMM)