Presentation | 2015-12-02 Evaluation and Analysis of Duration Correction for Non-Native Speech Based on Waveform Modification Shinya Kura, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | There are several attempts at correcting durational patterns of non-native speech towards language learning. One of the typical approaches modifies a speech parameter sequence with Dynamic Time Warping (DTW) using native speech as the reference, generating corrected speech from the modified speech parameter sequence. Although this approach makes it possible to flexibly modify durational patterns of non-native speech, quality of the corrected speech significantly degrades due to the use of analysis-synthesis process to generate the corrected speech. In this report, we propose a method for correcting durational patterns using direct waveform modification for performing DTW. In calculating a temporal warping function, statistical voice conversion is effectively used to reduce an adverse effect caused by speaker differences. Moreover, phoneme insertion often observed in non-native speech is also handled. We conducted an experimental evaluation using English speech read by Japanese, demonstrating that the proposed method was capable of flexibly modifying durational patterns while avoiding quality degradation caused by the analysis-synthesis process. Furthermore, waveform segments suffering from quality degradation caused by temporal warping was analyzed using the modulation spectrum of spectral parameters. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | non-native speech / correction of durational patterns / dynamic time warping / waveform modification / modulation spectrum |
Paper # | SP2015-73 |
Date of Issue | 2015-11-25 (SP) |
Conference Information | |
Committee | NLC / IPSJ-NL / SP / IPSJ-SLP |
---|---|
Conference Date | 2015/12/2(3days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Nagoya Inst of Tech. |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | The Second Natural Language Processing Symposium & The 17th Spoken Language Symposium |
Chair | Koichi Takeuchi(Okayama Univ.) / Kentaro Inui(Tohoku Univ.) / Kazunori Mano(Shibaura Inst. of Tech.) / Koichi Shinoda(東工大) |
Vice Chair | Hiroshi Kanayama(IBM) / Makoto Ichise(NTT DoCoMo) / / Norihide Kitaoka(Tokushima Univ.) |
Secretary | Hiroshi Kanayama(Univ. of Tokyo/Hottolink) / Makoto Ichise(Ryukoku Univ.) / (Osaka Univ.) / Norihide Kitaoka(Tohoku Univ.) / (Mixi Co. Ltd.) |
Assistant | Kazutaka Shimada(Kyushu Inst. of Tech.) / Ryuichiro Higashinaka(NTT) / / Takashi Nose(Tohoku Univ.) / Taichi Asami(NTT) |
Paper Information | |
Registration To | Technical Committee on Natural Language Understanding and Models of Communication / Special Interest Group on Natural Language / Technical Committee on Speech / Special Interest Group on Spoken Language Processing |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Evaluation and Analysis of Duration Correction for Non-Native Speech Based on Waveform Modification |
Sub Title (in English) | |
Keyword(1) | non-native speech |
Keyword(2) | correction of durational patterns |
Keyword(3) | dynamic time warping |
Keyword(4) | waveform modification |
Keyword(5) | modulation spectrum |
1st Author's Name | Shinya Kura |
1st Author's Affiliation | Nara Institute of Science and Technology(NAIST) |
2nd Author's Name | Shinnosuke Takamichi |
2nd Author's Affiliation | Nara Institute of Science and Technology(NAIST) |
3rd Author's Name | Tomoki Toda |
3rd Author's Affiliation | Nara Institute of Science and Technology/Nagoya University(NAIST/Nagoya Univ.) |
4th Author's Name | Graham Neubig |
4th Author's Affiliation | Nara Institute of Science and Technology(NAIST) |
5th Author's Name | Sakriani Sakti |
5th Author's Affiliation | Nara Institute of Science and Technology(NAIST) |
6th Author's Name | Satoshi Nakamura |
6th Author's Affiliation | Nara Institute of Science and Technology(NAIST) |
Date | 2015-12-02 |
Paper # | SP2015-73 |
Volume (vol) | vol.115 |
Number (no) | SP-346 |
Page | pp.pp.19-24(SP), |
#Pages | 6 |
Date of Issue | 2015-11-25 (SP) |