Presentation 2020-03-02
The Effectiveness of Additional Context in DNN-based Spontaneous Speech Synthesis
Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi, Yusuke Ijima, Ryo Masumura, Hiroshi Saruwatari,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In DNN-based speech synthesis, contexts, which are input features of DNN, can be used not only for the representation of linguistic information but also for that of para- and non- linguistic information. Although spontaneous speech synthesis requires the use of various contexts to express the diversity of prosody in spontaneous speech, it is not clear what features are important. In this study, we utilize the rich tags annotated in Corpus of Spontaneous Japanese (CSJ), and use them as the extended contexts. Experimental evaluation results show that both frequently- and infrequently- observed tags are effective for synthesizing spontaneous speech.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) speech synthesis / context / spontaneous speech / annotation / deep neural network
Paper # EA2019-112,SIP2019-114,SP2019-61
Date of Issue 2020-02-24 (EA, SIP, SP)

Conference Information
Committee SP / EA / SIP
Conference Date 2020/3/2(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Okinawa Industry Support Center
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Hisashi Kawai(NICT) / Kenichi Furuya(Oita Univ.) / Naoyuki Aikawa(TUS)
Vice Chair Akinobu Ri(Nagoya Inst. of Tech.) / Suehiro Shimauchi(Kanazawa Inst. of Tech.) / Shigeto Takeoka(Shizuoka Inst. of Science and Tech.) / Kazunori Hayashi(Osaka City Univ) / Yukihiro Bandou(NTT)
Secretary Akinobu Ri(Kyoto Univ.) / Suehiro Shimauchi(Waseda Univ.) / Shigeto Takeoka(NHK) / Kazunori Hayashi(Univ. of Tokyo) / Yukihiro Bandou(Hiroshima Univ.)
Assistant Tomoki Koriyama(Univ. of Tokyo) / Yusuke Ijima(NTT) / Keisuke Imoto(Ritsumeikan Univ.) / Daisuke Morikawa(Toyama Pref Univ.) / Kenjiro Sugimoto(Waseda Univ.)

Paper Information
Registration To Technical Committee on Speech / Technical Committee on Engineering Acoustics / Technical Committee on Signal Processing
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) The Effectiveness of Additional Context in DNN-based Spontaneous Speech Synthesis
Sub Title (in English)
Keyword(1) speech synthesis
Keyword(2) context
Keyword(3) spontaneous speech
Keyword(4) annotation
Keyword(5) deep neural network
1st Author's Name Yuki Yamashita
1st Author's Affiliation The University of Tokyo(UTokyo)
2nd Author's Name Tomoki Koriyama
2nd Author's Affiliation The University of Tokyo(UTokyo)
3rd Author's Name Yuki Saito
3rd Author's Affiliation The University of Tokyo(UTokyo)
4th Author's Name Shinnosuke Takamichi
4th Author's Affiliation The University of Tokyo(UTokyo)
5th Author's Name Yusuke Ijima
5th Author's Affiliation NTT Media Intelligence Laboratories(NTT)
6th Author's Name Ryo Masumura
6th Author's Affiliation NTT Media Intelligence Laboratories(NTT)
7th Author's Name Hiroshi Saruwatari
7th Author's Affiliation The University of Tokyo(UTokyo)
Date 2020-03-02
Paper # EA2019-112,SIP2019-114,SP2019-61
Volume (vol) vol.119
Number (no) EA-439,SIP-440,SP-441
Page pp.pp.65-70(EA), pp.65-70(SIP), pp.65-70(SP),
#Pages 6
Date of Issue 2020-02-24 (EA, SIP, SP)