［ポスター講演］Anti-spoofingに敵対するDNN音声変換の評価

齋藤 佑樹; 高道 慎之介; 猿渡 洋

講演名	2017-01-21 ［ポスター講演］Anti-spoofingに敵対するDNN音声変換の評価齋藤佑樹(東大), 高道慎之介(東大), 猿渡洋(東大),
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	統計的パラメトリック音声合成において，生成される合成音声の音質劣化は深刻な問題となる．これまでに我々はテキスト音声合成において，合成音声による声のなりすましを防ぐ技術である anti-spoofing に敵対する音響モデル学習法 (敵対的 DNN 音声合成) を提案し，有効性を示している．本稿では，敵対的 DNN 音声合成の枠組みを音声変換へ適用し，高音質な音声変換を実現するための DNN 音響モデルの学習アルゴリズムを提案する．実験的評価により，(1) Feed-Forward 型ネットワークを用いた特徴量変換に基づく DNN 音声変換，及び，本稿で新たに提案する，(2) highway network を用いた差分スペクトル推定に基づく DNN 音声変換の両方において提案アルゴリズムによる音質改善効果が得られることを示す．
抄録(英)	This paper proposes a novel training algorithm for high-quality Deep Neural Network (DNN)-based voice conversion. To improve speech quality in DNN-based text-to-speech synthesis, we have proposed a training algorithm to deceive anti-spoofing verification, called adversarial DNN-based speech synthesis. The anti-spoofing is a discriminator to distinguish natural and synthetic speech. This paper extends this idea to DNN-based voice conversion, and we build the acoustic models that can deceive the anti-spoofing verification. To evaluate the proposed algorithm, we conduct evaluations using two conversion frameworks: speech feature conversion using Feed-Forward neural networks and spectral differentials estimation using highway networks from input to output, which is proposed in this paper. The evaluation results successfully demonstrate the speech-quality improvements for both frameworks.
キーワード(和)	DNN音声変換 / anti-spoofing / 敵対的DNN音声合成 / highway network / 差分スペクトル / 過剰な平滑化
キーワード(英)	DNN-based voice conversion / anti-spoofing verification / adversarial DNN-based speech synthesis / highway networks / spectral differentials / over-smoothing
資料番号	SP2016-69
発行日	2017-01-14 (SP)

研究会情報
研究会	SP
開催期間	2017/1/21(から1日開催)
開催地（和）	東京大学
開催地（英）	The University of Tokyo
テーマ（和）	合成，生成，韻律，音声一般
テーマ（英）	Synthesis, Generation, Prosody, etc.
委員長氏名（和）	間野一則(芝浦工大)
委員長氏名（英）	Kazunori Mano(Shibaura Inst. of Tech.)
副委員長氏名（和）	森大毅(宇都宮大)
副委員長氏名（英）	Hiroki Mori(Utsunomiya Univ.)
幹事氏名（和）	滝口哲也(神戸大) / 西田昌史(静岡大)
幹事氏名（英）	Tetsuya Takiguchi(Kobe Univ.) / Masafumi Nishida(Shizuoka Univ.)
幹事補佐氏名（和）	浅見太一(NTT) / 橋本佳(名工大)
幹事補佐氏名（英）	Taichi Asami(NTT) / Kei Hashimoto(Nagoya Inst. of Tech.)

講演論文情報詳細
申込み研究会	Technical Committee on Speech
本文の言語	JPN
タイトル（和）	［ポスター講演］Anti-spoofingに敵対するDNN音声変換の評価
サブタイトル（和）
タイトル（英）	[Poster Presentation] Evaluation of DNN-Based Voice Conversion Deceiving Anti-spoofing Verification
サブタイトル（和）
キーワード(1)（和/英）	DNN音声変換 / DNN-based voice conversion
キーワード(2)（和/英）	anti-spoofing / anti-spoofing verification
キーワード(3)（和/英）	敵対的DNN音声合成 / adversarial DNN-based speech synthesis
キーワード(4)（和/英）	highway network / highway networks
キーワード(5)（和/英）	差分スペクトル / spectral differentials
キーワード(6)（和/英）	過剰な平滑化 / over-smoothing
第 1 著者氏名（和/英）	齋藤佑樹 / Yuki Saito
第 1 著者所属（和/英）	東京大学(略称：東大) The University of Tokyo(略称：UT)
第 2 著者氏名（和/英）	高道慎之介 / Shinnosuke Takamichi
第 2 著者所属（和/英）	東京大学(略称：東大) The University of Tokyo(略称：UT)
第 3 著者氏名（和/英）	猿渡洋 / Hiroshi Saruwatari
第 3 著者所属（和/英）	東京大学(略称：東大) The University of Tokyo(略称：UT)
発表年月日	2017-01-21
資料番号	SP2016-69
巻番号（vol）	vol.116
号番号（no）	SP-414
ページ範囲	pp.29-34(SP),
ページ数	6
発行日	2017-01-14 (SP)