Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2019-12-06 13:55 |
Tokyo |
NHK Science & Technology Research Labs. |
[Poster Presentation]
Synthetic speech-based sound masking for privacy protection when speaking to smartphones in public space Takahiro Tsugui, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2019-38 |
In this paper, we propose a synthetic speech-based sound masking method that protects the privacy when speaking to smart... [more] |
SP2019-38 pp.55-60 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2019-12-06 16:00 |
Tokyo |
NHK Science & Technology Research Labs. |
A comparison of neural vocoders in singing voice synthesis Sota Wada, Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2019-42 |
In this study, we compare five types of vocoders based on neural networks (neural vocoders) for singing voice synthesis.... [more] |
SP2019-42 pp.85-90 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
Initial analysis of emotional speech acted in noise Yi Zhao (NII), Atsushi Ando (NTT), Shinji Takaki, Junichi Yamagishi (NII), Satoshi Kobashikawa (NTT) EA2018-120 SIP2018-126 SP2018-82 |
Speakers usually adjust their way of talking in noisy environments involuntarily for effective communication, this adapt... [more] |
EA2018-120 SIP2018-126 SP2018-82 pp.125-130 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
CWT spectral loss for training a DNN-based speech waveform model Shinji Takaki (NII), Hirokazu Kameoka (NTT), Junichi Yamagishi (NII) EA2018-121 SIP2018-127 SP2018-83 |
[more] |
EA2018-121 SIP2018-127 SP2018-83 pp.131-135 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
Use and evaluation of Tacotron and context features in rakugo speech synthesis Shuhei Kato (SOKENDAI/NII), Shinji Takaki, Junichi Yamagishi (NII), Yusuke Yasuda (SOKENDAI/NII), Xin Wang (NII) EA2018-126 SIP2018-132 SP2018-88 |
We have been working on constructing rakugo (a traditional Japanese verbal entertainment) speech synthesis toward speech... [more] |
EA2018-126 SIP2018-132 SP2018-88 pp.161-166 |
SP |
2017-08-30 10:25 |
Kyoto |
Kyoto Univ. |
Autoregressive quantized F0 modeling using a recurrent neural network with feedback links Xin Wang, Shinji Takaki, Junichi Yamagishi (NII) SP2017-21 |
[more] |
SP2017-21 pp.7-12 |
PRMU, SP |
2017-06-22 14:45 |
Miyagi |
|
Postfiltering of STFT Spectrograms Based on Generative Adversarial Networks Takuhiro Kaneko (NTT), Shinji Takaki (NII), Hirokazu Kameoka (NTT), Junichi Yamagishi (NII) PRMU2017-28 SP2017-4 |
This paper presents postfiltering of short-term Fourier transform (STFT) spectrograms based on Generative Adversarial Ne... [more] |
PRMU2017-28 SP2017-4 pp.17-22 |
SP |
2017-01-21 13:00 |
Tokyo |
The University of Tokyo |
[Invited Talk]
Interesting! Deep learning for text-to-speech synthesis Shinji Takaki (NII) SP2016-71 |
(To be available after the conference date) [more] |
SP2016-71 pp.41-46 |
SP |
2016-10-27 16:25 |
Shizuoka |
Shizuoka University. |
A DNN-based Text-to-Speech Synthesis System using Speaker, Gender and Age Codes Hieu Thi Luong (VNU - HCM - University of Science), Shinji Takaki (NII), SangJin Kim (Naver Labs), Junichi Yamagishi (NII) SP2016-48 |
(To be available after the conference date) [more] |
SP2016-48 pp.37-42 |
SP |
2016-10-27 16:50 |
Shizuoka |
Shizuoka University. |
Investigating the impact of a neural network's depth on spectral and F0 modelling for parametric speech synthesis Xin Wang (SOKENDAI), Shinji Takaki, Junichi Yamagishi (NII) SP2016-49 |
[more] |
SP2016-49 pp.43-48 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2015-12-03 09:25 |
Aichi |
Nagoya Inst of Tech. |
Deep Auto-encoder based Low-dimensional Feature Extraction using FFT Spectral Envelopes in Statistical Parametric Speech Synthesis Shinji Takaki, Junichi Yamagishi (NII) SP2015-81 |
In the state-of-the-art statistical parametric speech synthesis system, a speech analysis module, e.g. STRAIGHT spectral... [more] |
SP2015-81 pp.99-104 |
SP, IPSJ-SLP (Joint) |
2015-07-17 09:30 |
Nagano |
Katakura Suwako Hotel |
Multiple Feed-forward Deep Neural Networks for Statistical Parametric Speech Synthesis Shinji Takaki (NII), SangJin Kim (Naver Labs), Junichi Yamagishi (NII), JongJin Kim (Naver Labs) SP2015-44 |
In this paper, we investigate a combination of several feed-forward deep neural networks (DNNs) for a high-quality stati... [more] |
SP2015-44 pp.49-54 |
PRMU |
2013-02-22 09:30 |
Osaka |
|
Extended separable lattice HMMs based on state duration control for recognition of images with variations Takaya Makino, Shinji Takaki, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) PRMU2012-164 |
In this paper, an extension of separable lattice HMMs is described that (SL-HMM) introduces state duration control for d... [more] |
PRMU2012-164 pp.149-154 |