Presentation 2015-12-02
Evaluation and Analysis of Duration Correction for Non-Native Speech Based on Waveform Modification
Shinya Kura, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) There are several attempts at correcting durational patterns of non-native speech towards language learning. One of the typical approaches modifies a speech parameter sequence with Dynamic Time Warping (DTW) using native speech as the reference, generating corrected speech from the modified speech parameter sequence. Although this approach makes it possible to flexibly modify durational patterns of non-native speech, quality of the corrected speech significantly degrades due to the use of analysis-synthesis process to generate the corrected speech. In this report, we propose a method for correcting durational patterns using direct waveform modification for performing DTW. In calculating a temporal warping function, statistical voice conversion is effectively used to reduce an adverse effect caused by speaker differences. Moreover, phoneme insertion often observed in non-native speech is also handled. We conducted an experimental evaluation using English speech read by Japanese, demonstrating that the proposed method was capable of flexibly modifying durational patterns while avoiding quality degradation caused by the analysis-synthesis process. Furthermore, waveform segments suffering from quality degradation caused by temporal warping was analyzed using the modulation spectrum of spectral parameters.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) non-native speech / correction of durational patterns / dynamic time warping / waveform modification / modulation spectrum
Paper # SP2015-73
Date of Issue 2015-11-25 (SP)

Conference Information
Committee NLC / IPSJ-NL / SP / IPSJ-SLP
Conference Date 2015/12/2(3days)
Place (in Japanese) (See Japanese page)
Place (in English) Nagoya Inst of Tech.
Topics (in Japanese) (See Japanese page)
Topics (in English) The Second Natural Language Processing Symposium & The 17th Spoken Language Symposium
Chair Koichi Takeuchi(Okayama Univ.) / Kentaro Inui(Tohoku Univ.) / Kazunori Mano(Shibaura Inst. of Tech.) / Koichi Shinoda(東工大)
Vice Chair Hiroshi Kanayama(IBM) / Makoto Ichise(NTT DoCoMo) / / Norihide Kitaoka(Tokushima Univ.)
Secretary Hiroshi Kanayama(Univ. of Tokyo/Hottolink) / Makoto Ichise(Ryukoku Univ.) / (Osaka Univ.) / Norihide Kitaoka(Tohoku Univ.) / (Mixi Co. Ltd.)
Assistant Kazutaka Shimada(Kyushu Inst. of Tech.) / Ryuichiro Higashinaka(NTT) / / Takashi Nose(Tohoku Univ.) / Taichi Asami(NTT)

Paper Information
Registration To Technical Committee on Natural Language Understanding and Models of Communication / Special Interest Group on Natural Language / Technical Committee on Speech / Special Interest Group on Spoken Language Processing
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Evaluation and Analysis of Duration Correction for Non-Native Speech Based on Waveform Modification
Sub Title (in English)
Keyword(1) non-native speech
Keyword(2) correction of durational patterns
Keyword(3) dynamic time warping
Keyword(4) waveform modification
Keyword(5) modulation spectrum
1st Author's Name Shinya Kura
1st Author's Affiliation Nara Institute of Science and Technology(NAIST)
2nd Author's Name Shinnosuke Takamichi
2nd Author's Affiliation Nara Institute of Science and Technology(NAIST)
3rd Author's Name Tomoki Toda
3rd Author's Affiliation Nara Institute of Science and Technology/Nagoya University(NAIST/Nagoya Univ.)
4th Author's Name Graham Neubig
4th Author's Affiliation Nara Institute of Science and Technology(NAIST)
5th Author's Name Sakriani Sakti
5th Author's Affiliation Nara Institute of Science and Technology(NAIST)
6th Author's Name Satoshi Nakamura
6th Author's Affiliation Nara Institute of Science and Technology(NAIST)
Date 2015-12-02
Paper # SP2015-73
Volume (vol) vol.115
Number (no) SP-346
Page pp.pp.19-24(SP),
#Pages 6
Date of Issue 2015-11-25 (SP)