Presentation | 2020-03-02 The Effectiveness of Additional Context in DNN-based Spontaneous Speech Synthesis Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi, Yusuke Ijima, Ryo Masumura, Hiroshi Saruwatari, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In DNN-based speech synthesis, contexts, which are input features of DNN, can be used not only for the representation of linguistic information but also for that of para- and non- linguistic information. Although spontaneous speech synthesis requires the use of various contexts to express the diversity of prosody in spontaneous speech, it is not clear what features are important. In this study, we utilize the rich tags annotated in Corpus of Spontaneous Japanese (CSJ), and use them as the extended contexts. Experimental evaluation results show that both frequently- and infrequently- observed tags are effective for synthesizing spontaneous speech. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | speech synthesis / context / spontaneous speech / annotation / deep neural network |
Paper # | EA2019-112,SIP2019-114,SP2019-61 |
Date of Issue | 2020-02-24 (EA, SIP, SP) |
Conference Information | |
Committee | SP / EA / SIP |
---|---|
Conference Date | 2020/3/2(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Okinawa Industry Support Center |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | Hisashi Kawai(NICT) / Kenichi Furuya(Oita Univ.) / Naoyuki Aikawa(TUS) |
Vice Chair | Akinobu Ri(Nagoya Inst. of Tech.) / Suehiro Shimauchi(Kanazawa Inst. of Tech.) / Shigeto Takeoka(Shizuoka Inst. of Science and Tech.) / Kazunori Hayashi(Osaka City Univ) / Yukihiro Bandou(NTT) |
Secretary | Akinobu Ri(Kyoto Univ.) / Suehiro Shimauchi(Waseda Univ.) / Shigeto Takeoka(NHK) / Kazunori Hayashi(Univ. of Tokyo) / Yukihiro Bandou(Hiroshima Univ.) |
Assistant | Tomoki Koriyama(Univ. of Tokyo) / Yusuke Ijima(NTT) / Keisuke Imoto(Ritsumeikan Univ.) / Daisuke Morikawa(Toyama Pref Univ.) / Kenjiro Sugimoto(Waseda Univ.) |
Paper Information | |
Registration To | Technical Committee on Speech / Technical Committee on Engineering Acoustics / Technical Committee on Signal Processing |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | The Effectiveness of Additional Context in DNN-based Spontaneous Speech Synthesis |
Sub Title (in English) | |
Keyword(1) | speech synthesis |
Keyword(2) | context |
Keyword(3) | spontaneous speech |
Keyword(4) | annotation |
Keyword(5) | deep neural network |
1st Author's Name | Yuki Yamashita |
1st Author's Affiliation | The University of Tokyo(UTokyo) |
2nd Author's Name | Tomoki Koriyama |
2nd Author's Affiliation | The University of Tokyo(UTokyo) |
3rd Author's Name | Yuki Saito |
3rd Author's Affiliation | The University of Tokyo(UTokyo) |
4th Author's Name | Shinnosuke Takamichi |
4th Author's Affiliation | The University of Tokyo(UTokyo) |
5th Author's Name | Yusuke Ijima |
5th Author's Affiliation | NTT Media Intelligence Laboratories(NTT) |
6th Author's Name | Ryo Masumura |
6th Author's Affiliation | NTT Media Intelligence Laboratories(NTT) |
7th Author's Name | Hiroshi Saruwatari |
7th Author's Affiliation | The University of Tokyo(UTokyo) |
Date | 2020-03-02 |
Paper # | EA2019-112,SIP2019-114,SP2019-61 |
Volume (vol) | vol.119 |
Number (no) | EA-439,SIP-440,SP-441 |
Page | pp.pp.65-70(EA), pp.65-70(SIP), pp.65-70(SP), |
#Pages | 6 |
Date of Issue | 2020-02-24 (EA, SIP, SP) |